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* Preface * 


This book is about the use of digital computers in the real-time control of dynamic 
systems such as servomechanisms, chemical processes, and vehicles that move 
over water, land, air, or space. The material requires some understanding of 
the Laplace transform and assumes that the reader has studied linear feedback 
controls. The special topics of discrete and sampled-data system analysis are 
introduced, and considerable emphasis is given to the z-transform and the close 
connections between the z-transform and the Laplace transform. 

The book's emphasis is on designing digital controls to achieve good dy- 
namic response and small errors while using signals that are sampled in time 
and quantized in amplitude. Both transform (classical control) and state-space 
(modern control) methods are described and applied to illustrative examples. The 
transform methods emphasized are the root-locus method of Evans and frequency 
response. The root-locus method can be used virtually unchanged for the discrete 
case; however, Bode's frequency response methods require modification for use 
with discrete systems. The state-space methods developed are the technique of 
pole assignment augmented by an estimator (observer) and optimal quadratic- 
loss control. The optimal control problems use the steady-state constant-gain 
solution; the results of the separation theorem in the presence of noise are stated 
but not proved. 

Each of these design methods—classical and modern alike—has advantages 
and disadvantages, strengths and limitations. It is our philosophy that a designer 
must understand all of them to develop a satisfactory design with the least effort. 

Closely related to the mainstream of ideas for designing linear systems that 
result in satisfactory dynamic response are the issues of sample-rate selection, 
model identification, and consideration of nonlinear phenomena. Sample-rate 
selection is discussed in the context of evaluating the increase in a least-squares 
performance measure as the sample rate is reduced. The topic of model making is 
treated as measurement of frequency response, as well as least-squares parameter 
estimation. Finally, every designer should be aware that all models are nonlinear 
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and be familiar with the concepts of the describing functions of nonlinear systems, 
methods of studying stability of nonlinear systems, and the basic concepts of 
nonlinear design. 

Material that may be new to the student is the treatment of signals which are 
discrete in time and amplitude and which must coexist with those that are con- 
tinuous in both dimensions. The philosophy of presentation is that new material 
should be closely related to materal already familiar, and yet, bythe end, indicate 
a direction toward wider horizons. This approach leads us, for example, to relate 
the z-transform to the Laplace transform and to describe the implications of poles 
and zeros in the z-plane to the known meanings attached to poles and zeros in 
the s-plane. Also, in developing the design methods, we relate the digital control 
design methods to those of continuous systems. For more sophisticated methods, 
we present the elementary parts of quadratic-loss Gaussian design with minimal 
proofs to give some idea of how this powerful method is used and to motivate 
further study of its theory. 

The use of computer-aided design (CAD)is universal for practicing engineers 
in this field, as in most other fields. We have recognized this fact and provided 
guidance to the reader so that learning the controls analysis material can be 
integrated with learning how to compute the answers with MATLAB, the most 
widely used CAD software package in universities. In many cases, especially in 
the earlier chapters, actual MATLAB scripts are included in the text to explain how 
to carry out a calculation. In other cases, the MATLAB routine is simply named for 
refaence. All the routines given are tabulated in Appendix E for easy refaence; 
therefore, this book can be used as a reference for learning how to use MATLAB 
in control calculations as well as for control systems analysis. In short, we have 
tried to describe the entire process, from learning the concepts to computing the 
desired results. But we hastento add that it is mandatory that the student retain the 
ability to compute simple answers by hand so that the computer's reasonableness 
can be judged. The First Law of Computers for engineers remains "Garbage In, 
Garbage Out.” 

Most of the graphical figures in this third edition were generated using 
MATLAB? supplied by The Mathworks, Inc. The files that created the figures are 
available from Ellis-Kagle Press at www.elliskaglepress.com. The 
reader is encouraged to use these MATLAB figure files as an additional 
guide in learning how to perform the various calculations. 

To review the chapters briefly: Chapter 1 contains introductory comments. 
Chapters 2 and 3 are new to the third edition. Chapter 2 is a review of the 
prerequisite continuous control; Chapter 3 introduces the key effects of 
sampling in order to elucidate many of the topics that follow. Methods of linear 
analysis are presented in Chapters 4 through 6. Chapter 4 presents the 
z-transform. Chapter 5 introduces combined discrete and continuous systems, 
the sampling theorem, and the phenomenon of aliasing. 
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Chapter 6 shows methods by which to generate discrete equations that will 
approximate continuous dynamics. The basic deterministic design methods are 
presented in Chapters 7 and 8—the root-locus and frequency response methods 
in Chapter 7 and pole placement and estimators in Chapter 8. The state-space 
material assumes no previous acquaintance with the phase plane or state space, 
and the necessary analysis is developed from the ground up. Some familiarity 
with simultaneous linear equations and matrix notation is expected, and a few 
unusual or more advanced topics such as eigenvalues, eigenvectors, and the 
Cayley-Hamilton theorem are presented in Appendix C. Chapter 9 introduces 
optimal quadratic-loss control: First the control by state feedback is presented 
and then the estimation of the state in the presence of system and 
measurement noise is developed, based on a recursive least-squares estimation 
derivation. 

In Chapter 10 the nonlinear phenomenon of amplitude quantization and 
its effects on system error and system dynamic response are studied. Chap-^ 
ter 11 presents methods of analysis and design guidelines for the selection of, 
the sampling period in a digital control system. It utilizes the design methods 
discussed in Chapters 7, 8, and 9, in examples illustrating the effects of sample 
rate. Chapter 12 introduces both nonparametric and parametric identification. 
Nonparametric methods are based on spectral estimation. Parametric methods 
are introduced by starting with deterministic least squares, introducing random 
errors, and completing the solution with an algorithm for maximum likelihood. 
Sub-space methods are also introduced for estimating the state matrices directly. 
Nonlinear control is the subject of Chapter 13, including examples of plant non- 
linearities and methods for the analysis and design of controllers for nonlinear 
models. Simulation, stability analysis, and performance enhancement by non- 
linear controllers and by adaptive designs are also included in Chapter 13. The 
chapter ends with a nonlinear design optimization alternative to the techniques 
presented in Chapter 9. The final chapter, 14, is a detailed design example of a 
digital servo for a disk drive head. Chapters 13 and 14 are not included in this 
ePUB edition; however, they are available by request to elliskagle& gmail.com. 

For purposes of organizing a course, Fig. P.1l shows the dependence of 
material in each chapter on previous chapters. By following the solid lines, the 
reader will have all the background required to understand the material in a 
particular chapter, even if the path omits some chapters. Furthermore, sections 
with a star(*) are optional and may be skipped with no loss of continuity. 
Chapters may also be skipped, as suggested by the dashed lines, if the reader is 
willing to take some details on faith; however, the basic ideas of the later 
chapters will be understood along these paths. 

The first seven chapters (skipping or quickly reviewing Chapter 2) constitute 
a comfortable one-quarter course that would follow a course in continuous 
linear control using a text such as Franklin, Powell, and Emami-Naeini (2019). 
For a one-semester course, the first eight chapters represent a comfortable load. 
The 
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3rd Edition 3rd eEdition 
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Introduction 1 1 
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content of a second course has many possibilities. One possibility is to combine 
Chapters 8 and 9 with Chapter 10, 11, or 12. As can be seen from the figure, 
many options exist for including the material in the last five chapters. For a full- 
year course, all fourteen chapters can be covered. One of the changes made in 
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this third edition is that the optimal control material no longer depends on the least- 
squares development in the system identification chapter, thus allowing for more 
flexibility in the sequence of teaching. 

It has been found at Stanford that it is very useful to supplement the 
lectures with laboratory work to enhance learning. A very satisfactory 
complement of laboratory equipment is a digital computer having an A/D and a D/A 
converter, an analog computer (or equivalent) with ten operational 
amplifiers, a digital storage scope, and a CAD package capable of performing the 
basic computations and plotting graphs as described in [Franklin, Powell (1989)]. 

There are many important topics in control that we have not been able to include 
in this book. There is, for example, no discussion of mu analysis or design, 
linear matrix inequalities, or convex optimization. It is our expectation, however, that 
careful study of this book will provide the student engineer with a sound basis for 
design of sampled-data controls and a foundation for the study of these and many 
other advanced topics in this most exciting field. 

We wish to acknowledge the vast array of contributors on whose work our own 

presentation is based. The list of references gives some indication of those to whom 

we are in debt. On a more personal level, we wish to express our appreciation 
to Profs. S. Boyd, A. Bryson, R. Cannon, S. Citron, J. How, and S. Rock for their 
valuable suggestions for the book and especially to our long-time colleague, 
Prof. Dan DeBra, for his careful reading and many spirited suggestions. We 
also wish to express our appreciation for many valuable suggestions to the 
current and former students of E207 and E208, for whom this book was written. 

In addition, we want to thank the following people for their helpful 
reviews of the manuscript: Fred Bailey, University of Minnesota; John Fleming, 
Texas A&M University; J.B. Pearson, Rice University; William Perkins, 
University of Illinois; James Carroll, Clarkson University; Walter Higgins, Jr., 
Arizona State University; Stanley Johnson, Lehigh University; Thomas Kurfess, 
Georgia Institute of Technology; Stephen Phillips, Case Western Reserve 
University; Chris Rahn, Clemson University; T. Srinivasan, Wilkes University; 
Hal Tharp, University of Arizona; Russell Trahan, Jr., University of New 
Orleans; and Gary Young, Oklahoma State University. We also wish to 
express our appreciation to Tom Robbins for his help with all the editions of the 
book, and to Laura Cheu, Emilie Bauer, and all the staff at Addison-Wesley for their 
quality production of this 3rd edition. 


Stanford, California 
GFE 
J.D.P. 
M.L.W. 
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Introduction 


1.1 


A Perspective on Digital Control 


The control of physical systems with a digital computer or microcontroller is 
becoming more and more common. Examples of electromechanical servomech- 
anisms exist in aircraft, automobiles, mass-transit vehicles, oil refineries, and 
paper-making machines. Furthermore, many new digital control applications are 
being stimulated by microprocessor technology including control of various as- 
pects of automobiles and household appliances. Among the advantages of digital 
approaches for control are the increased flexibility of the control programs and 
the decision-making or artificial intelligence capability of digital systems, 
which can be combined with the dynamic control function to meet other system 
requirements. In addition, one hardware design can be used with many 
different software variations on a broad range of products, thus simplifying and 
reducing the design time. 


Chapter Overview 


In Section 1.1, you will learn about what a digital control system is, what the 
typical structure is, and what the basic elements are. The key issues are 
discussed and an overview of where those issues are discussed in the book is 
given. Section 1.2 discusses the design approaches used for digital control 
systems and provides an overview of where the different design approaches 
appear in the book. Computer Aided Control System Design (CACSD) issues 
and how the book’s authors have chosen to handle those issues are discussed in 
Section 1.3. 


Problem Definition 


The digital controls studied in this book are for closed-loop (feedback) systems 
in which the dynamic response of the process being controlled is a major con- 
sideration in the design. A typical structure of the elementary type of system 
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2 Chapter 1 


Figure 1.1 


Block diagram of a basic 
digital control system 


Introduction 


that will occupy most of our attention is sketched schematically in Fig. 1.1. This 
figure will help to define our basic notation and to introduce several features that 
distinguish digital controls from those implemented with analog devices. The 
process to be controlled (sometimes referred to as the plant) may be any of the 
physical processes mentioned above whose satisfactory response requires control 
action. 

By "satisfactory response” we mean that the plant output, y(t), is to be forced 
to follow or track the reference input, r(t), despite the presence of disturbance 
inputs to the plant [w(f) in Fig. 1.1] and despite errors in the sensor [v(t) in 
Fig. 1.1]. It is also essential that the tracking succeed even if the dynamics of 
the plant should change somewhat during the operation. The process of holding 
y(t) close to r(t), including the case where r = 0, is referred to generally as 
the process of regulation. A system that has good regulation in the presence of 
disturbance signals is said to have good disturbance rejection. A system that 
has good regulation in the face of changes in the plant parameters is said to have 
low sensitivity to these parameters. A system that has both good disturbance 
rejection and low sensitivity we call robust. 


w(t) 


(kT) + Digital | uan | 


computer NK JA 


v(t) 
Notation: 
r = reference or command inputs 
u = control or actuator input signal 
y = controlled or output signal 
9 = instrument or sensor output, usually an approximation to or estimate 


of y. (For any variable, say 0, the notation Ó is now commonly taken 
from statistics to mean an estimate of 6.) 


6 = r—j - indicated error 

€ = r—y = System error 

w  * disturbance input to the plant 

v = disturbance or noise in the sensor 
A/D = analog-to-digital converter 

D/A = digital-to-analog converter 
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The means by which robust regulation is to be accomplished is through the 
control inputs to the plant [u(t) in Fig. 1.1]. It was discovered long ago! that 
a scheme of feedback wherein the plant output is measured (or sensed) and 
compared directly with the reference input has many advantages in the effort to 
design robust controls over systems that do not use such feedback. Much of our 
effort in later parts of this book will be devoted to illustrating this discovery and 
demonstrating how to exploit the advantages of feedback. However, the problem 
of control as discussed thus far is in no way restricted to digital control. For that 
we must consider the unique features of Fig. 1.1 introduced by the use of a digital 
device to generate the control action. 

We consider first the action of the analog-to-digital (A/D) converter on a 
signal. This device acts on a physical variable, most commonly an electrical 
voltage, and converts it into a stream of numbers. In Fig. 1.1, the A/D converter 
acts on the sensor output and supplies numbers to the digital computer. It is 
common for the sensor output, y, to be sampled and to have the error formed in 
the computer. We need to know the times at which these numbers arrive if we are 
to analyze the dynamics of this system. 

In this book we will make the assumption that all the numbers arrive with the 
same fixed period T, called the sample period. In practice, digital control sys- 
tems sometimes have varying sample periods and/or different periods in different 
feedback paths. Usually there is a clock as part of the computer logic which sup- 
plies a pulse or interrupt every T seconds, and the A/D converter sends a number 
to the computer each time the interrupt arrives. An alternative implementation is 
simply to access the A/D upon completion of each cycle of the code execution, 
a scheme often referred to as free running. A further alternative is to use some 
other device to determine a sample, such as an encoder on an engine crankshaft 
that supplies a pulse to trigger a computer cycle. This scheme is referred to as 
event-based sampling. In the first case the sample period is precisely fixed; in 
the second case the sample period is essentially fixed by the length of the code, 
providing no logic branches are present that could vary the amount of code ex- 
ecuted; in the third case, the sample period varies with the engine speed. Thus 
in Fig. 1.1 we identify the sequence of numbers into the computer as e(k T). We 
conclude from the periodic sampling action of the A/D converter that some of 
the signals in the digital control system, like (kT), are variable only at discrete 
times. We call these variables discrete signals to distinguish them from variables 
like w and y, which change continuously in time. A system having both discrete 
and continuous signals is called a sampled-data system. 

In addition to generating a discrete signal, however, the A/D converter also 
provides a quantized signal. By this we mean that the output of the A/D 
converter must be stored in digital logic composed of a finite number of digits. 
Most commonly, ofcourse, the logic is based on binary digits (i.e., bits) composed 


] See especially the book by Bode (1945). 
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Figure 1.2 

Plot of output versus 
input characteristics of 
the A/D converter 


of 0’s and 1’s, but the essential feature is that the representation has a finite number 
of digits. A common situation is that the conversion of y to y is done so that y 
can be thought of as a number with a fixed number of places of accuracy. If we 
plot the values of y versus the resulting values of y we can obtain a plot like that 
shown in Fig. 1.2. We would say that y has been truncated to one decimal place, 
or that y is quantized with a q of 0.1, since y changes only in fixed quanta of, 
in this case, 0.1 units. (We will use g for quantum size, in general.) Note that 
quantization is a nonlinear function. A signal that is both discrete and quantized 
is called a digital signal. Not surprisingly, digital computers in this book process 
digital signals. 

In a real sense the problems of analysis and design of digital controls are 
concerned with taking account of the effects of the sampling period T and the 
quantization size q. If both T and q are extremely small (sampling frequency 
30 or more times the system bandwidth with a 16-bit word size), digital signals 
are nearly continuous, and continuous methods of analysis and design can be 
used. The resulting design could then be converted to the digital format for 
implementation in a computer by using the simple methods described in Chapter 3 
or the emulation method described in Chapter 7. We will be interested in this text 
in gaining an understanding of the effects of all sample rates, fast and slow, and the 
effects of quantization for large and small word sizes. Many systems are originally 
conceived with fast sample rates, and the computer is specified and frozen early 
in the design cycle; however, as the designs evolve, more demands are placed on 
the system, and the only way to accommodate the increased computer load is to 
slow down the sample rate. Furthermore, for cost-sensitive digital systems, the 
best design is the one with the lowest cost computer that will do the required job. 
That translates into being the computer with the slowest speed and the smallest 
word size. We will, however, treat the problems of varying T and q separately. 
We first consider q to be zero and study discrete and sampled-data (combined 
discrete and continuous) systems that are linear. In Chapter 10 we will analyze 


aliasing 
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in more detail the source and the effects of quantization, and we will discuss in 
Chapters 7 and 11 specific effects of sample-rate selection. 

Our approach to the design of digital controls is to assume a background 
in continuous systems and to relate the comparable digital problem to its con- 
tinuous counterpart. We will develop the essential results, from the beginning, 
in the domain of discrete systems, but we will call upon previous experience 
in continuous-system analysis and in design to give alternative viewpoints and 
deeper understanding of the results. In order to make meaningful these references 
to a background in continuous-system design, we will review the concepts and 
define our notation in Chapter 2. 


Overview of Design Approach 


An overview of the path we plan to take toward the design of digital controls 
will be useful before we begin the specific details. As mentioned above, we 
place systems of interest in three categories according to the nature of the signals 
present. These are discrete systems, sampled-data systems, and digital systems. 

In discrete systems all signals vary at discrete times only. We will analyze 
these in Chapter 4 and develop the z-transform of discrete signals and “pulse’- 
transfer functions for linear constant discrete systems. We also develop discrete 
transfer functions of continuous systems that are sampled, systems that are called 
sampled-data systems. We develop the equations and give examples using both 
transform methods and state-space descriptions. Having the discrete transfer 
functions, we consider the issue of the dynamic response of discrete systems. 

A sampled-data system has both discrete and continuous signals, and often 
it is important to be able to compute the continuous time response. For example, 
with a slow sampling rate, there can be significant ripple between sample instants. 
Such situations are studied in Chapter 5. Here we are concerned with the question 
of data extrapolation to convert discrete signals as they might emerge from a 
digital computer into the continuous signals necessary for providing the input to 
one of the plants described above. This action typically occurs in conjunction 
with the D/A conversion. In addition to data extrapolation, we consider the 
analysis of sampled signals from the viewpoint of continuous analysis. For this 
purpose we introduce impulse modulation as a model of sampling, and we use 
Fourier analysis to give a clear picture for the ambiguity that can arise between 
continuous and discrete signals, also known as aliasing. The plain fact is that 
more than one continuous signal can result in exactly the same sample values. If 
a sinusoidal signal, y,, at frequency f, has the same samples as a sinusoid y, of a 
different frequency f,, y, is said to be an alias of y,. A corollary of aliasing is the 
sampling theorem, which specifies the conditions necessary if this ambiguity is 
to be removed and only one continuous signal allowed to correspond to a given 
set of samples. 
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Introduction 


As a special case of discrete systems and as the basis for the emulation 
design method, we consider discrete equivalents to continuous systems, which 
is one aspect of the field of digital filters. Digital filters are discrete systems 
designed to process discrete signals in such a fashion that the digital device (a 
digital computer, for example) can be used to replace a continuous filter. Our 
treatment in Chapter 6 will concentrate on the use of discrete filtering techniques 
to find discrete equivalents of continuous-control compensator transfer functions. 
Again, both transform methods and state-space methods are developed to help 
understanding and computation of particular cases of interest. 

Once we have developed the tools of analysis for discrete and sampled 
systems we can begin the design of feedback controls. Here we divide our tech- 
niques into two categories: transform’ and state-space’ methods. In Chapter 7 
we study the transform methods of the root locus and the frequency response 
as they can be used to design digital control systems. The use of state-space 
techniques for design is introduced in Chapter 8. For purposes of understanding 
the design method, we rely mainly on pole placement, a scheme for forcing the 
closed-loop poles to be in desirable locations. We discuss the selection of the 
desired pole locations and point out the advantages of using the optimal control 
methods covered in Chapter 9. Chapter 8 includes control design using feedback 
of all the "state variables" as well as methods for estimating the state variables 
that do not have sensors directly on them. In Chapter 9 the topic of optimal con- 
trol is introduced, with emphasis on the steady-state solution for linear constant 
discrete systems with quadratic loss functions. The results are a valuable part 
of the designer’s repertoire and are the only techniques presented here suitable 
for handling multivariable designs. A study of quantization effects in Chapter 10 
introduces the idea of random signals in order to describe a method for treating 
the "average" effects of this important nonlinearity. 

The last four chapters cover more advanced topics that are essential for most 
complete designs. The first of these topics is sample rate selection, contained 
in Chapter 11. In our earlier analysis we develop methods for examining the 
effects of different sample rates, but in this chapter we consider for the first time 
the question of sample rate as a design parameter. In Chapter 12, we introduce 
system identification. Here the matter of model making is extended to the use 
of experimental data to verify and correct a theoretical model or to supply a 
dynamic description based only on input-output data. Only the most elementary 
of the concepts in this enormous field can be covered, of course. We present the 
method of least squares and some of the concepts of maximum likelihood. 

In Chapter 13, an introduction to the most important issues and techniques 
for the analysis and design of nonlinear sampled-data systems is given. The 


2 Named because they use the Laplace or Fourier transform to represent systems. 

3 The state space is an extension of the space of displacement and velocity used in physics. Much that 
is called modern control theory uses differential equations in state-space form. We introduce this 
representation in Chapter 4 and use it extensively afterwards, especially in Chapters 8 and 9. 
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analysis methods treated are the describing function, equivalent linearization, and 
Lyapunov's second method of stability analysis. Design techniques described are 
the use of inverse nonlinearity, optimal control (especially time-optimal control), 
and adaptive control. Chapter 14 includes a case study of a disk-drive design, and 
treatment of both implementation and manufacturing issues is discussed. 


Computer-Aided Design 


As with any engineering design method, designof control systems requires many 

computations that are greatly facilitated by a good library of well-documented 
computer programs. [n designing practical digital control systems, and especially 

in iterating through the methods many times to meet essential specifications, an 
interactive computer-aided controlsystem design (CACSD) package with simple 

access to plotting graphics is crucial. Many commercial control system CACSD 

packages are available which satisfy that need, MATLAB? being the most 

popular one. Much of the discussion in the book assumes that a designer has 
access to one of the CACSD products. Specific MATLAB routines that can be 

used for performing calculations are indicated throughout the text and in 

some cases the full MATLAB command sequence is shown. All the graphical 

figures were developed using MATLAB and the files that created them are 

confained in the Digital Control Toolbox which is available on the Web at no 

charge. Files based on MATLAB with Control System Toolbox are available 
at elliskaglepress.com. These figure files should be helpful in 
understanding the specifics on how to do a calculation and are an important 
augmentation to the book's examples. 

CACSD support for a designer is now universal; however, it is essential 
that the designer is able to work out very simple problems by hand in order to 
have some idea about the reasonableness of the computer's answers. Having the 
knowledge of doing the calculations by hand is also critical for identifying 
trends that guide the designer; the computer can identify problems but the 
designer must make intelligent choices in guiding the refinement of the 
computer design. 


Suggestions for Further Reading 


Several histories of feedback control are readily available, including a Scientific 
American Book (1955), and the study of Mayr (1970). A good discussion of 
the historical developments of control is given by Dorf (1980) and by Fortmann 
and Hitz (1977), and many other references are cited by these authors for the 
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interested reader. One of the earliest published studies of control systems operat- 
ing on discrete time data (sampled-data systems in our terminology) is given by 
Hurewicz in Chapter 5 of the book by James, Nichols, and Phillips (1947). 
Another is by Ragazzini and Franklin (1958). 

The ideas of tracking and robustness embody many elements of the 
objectives of control system design. The concept of tracking contains the 
requirements of system stability, good transient response, and good 
steady-state accuracy, all concepts fundamental to every control system. 
Robustness is a property essential to good performance in practical designs 
because real parameters are subject to change and because external, unwanted 
signals invade every system. Discussion of performance specifications of 
control systems is given in most books on introductory control, including 
Franklin, Powell, and Emami-Naeini (2019). We will study these matters in 
later chapters with particular reference to digital control design. 

To obtain a firm understanding of dynamics, we suggest a comprehensive 
text by Cannon (1967) or chapter 2 in Franklin, Powell, and Emami-Naeini 
(2019). They are concerned with writing the equations of motion of physical 
systems in a form suitable for control studies. 


Summary 


e In adigital control system, the analog electronics used for compensation in a 
continuous system is replaced with a digital computer or microcontroller, an 
analog-to-digital (A/D) converter, and a digital-to-analog (D/A) converter. 


* Design of a digital control system can be accomplished by transforming a 
continuous design, called emulation, or designing the digital system directly. 
Either method can be carried out using transform or state-space system 
description. 

* The design of a digital control system includes determining the effect of the 
sample rate and selecting a rate that is sufficiently fast to meet all specifica- 
tions. 


e Most designs today are carried out using computer-based methods; however 
the designer needs to know the hand-based methods in order to intelligently 
guide the computer design as well as to have a sanity check on its results. 


1.6 Problems 


1.1 Suppose a radar search antenna at the San Francisco airport rotates at 6 rev/min, and data 
points corresponding to the position of flight 1081 are plotted on the controller's screen 
once per antenna revolution. Flight 1081 is traveling directly toward the airport at 540 
mi/hr. A feedback control system is established through the controller who gives course 
corrections to the pilot. He wishes to do so each 9 mi of travel of the aircraft, and his 
instructions consist of course headings in integral degree values. 
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(a) What is the sampling rate, in Hz, of the range signal plotted on the radar 
Screen? 


(b) What is the sampling rate, in Hz, of the controller's instructions? 
(c) Identify the following signals as continuous, discrete, or digital: 
i. the aircraft's range from the airport, 
ii. the range data as plotted on the radar screen, 
iii. the controller’s instructions to the pilot, 
iv. the pilot’s actions on the aircraft control surfaces. 
(d) Is this a continuous, sampled-data, or digital control system? 


(e) Show that it is possible for the pilot of flight 1081 to fly a zigzag course which 
would show up as a straight line on the controller’s screen. What is the (lowest) 
frequency of a sinusoidal zigzag course which will be hidden from the controller’s 
radar? 


If a signal varies between 0 and 10 volts (called the dynamic range) and it is required 
that the signal must be represented in the digital computer to the nearest 5 millivolts, 
that is, if the resolution must be 5 mv, determine how many bits the analog-to-digital 
converter must have. 


Describe five digital control systems that you are familiar with. State what you think the 
advantages of the digital implementation are over an analog implementation. 


Historically, house heating system thermostats were a bimetallic strip that would make 
or break the contact depending on temperature. Today, most thermostats are digital. 
Describe how you think they work and list some of their benefits. 


Use MATLAB (obtain a copy of the Student Edition or use what's available to you) and 
plot y vs x for x = 1 to 10 where y = x°. Label each axis and put a title on it. 


Use MATLAB (obtain a copy of the Student Edition or use what's available to you) and 
make two plots (use MATLAB's subplot) of y vs x for x — 1 to 10. Put a plot of y — 3* 
on the top of the page and y = ./x on the bottom. 
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A Perspective on the Review of Continuous Control 


The purpose of this chapter is to provide a ready reference source of the material 
that you have already taken in a prerequisite course. The presentation is not 
sufficient to learn the material for the first time; rather, it is designed to state 
concisely the key relationships for your reference as you move to the new material 
in the ensuing chapters. For a more in-depth treatment of any of the topics, see 
an introductory control text such as Feedback Control of Dynamic Systems, by 
Franklin, Powell, and Emami-Naeini (2019). 


Chapter Overview 


The chapter reviews the topics normally covered in an introductory controls 
course: dynamic response, feedback properties, root-locus design, frequency re- 
sponse design, and state-space design. 


Dynamic Response 


In control system design, it is important to be able to predict how well a trial 
design matches the desired performance. We do this by analyzing the equations 
of the system model. The equations can be solved using linear analysis approxi- 
mations or simulated via numerical methods. Linear analysis allows the designer 
to examine quickly many candidate solutions in the course of design iterations 
and is, therefore, a valuable tool. Numerical simulation allows the designer to 
check the final design more precisely including all known characteristics and is 
discussed in Section 13.2. The discussion below focuses on linear analysis. 
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2.1.1 


state-variable form 


state 


2.1.2 


Differential Equations 


Linear dynamic systems can be described by their differential equations. Many 
systems involve coupling between one part of a system and another. Any set of 
differential equations of any order can be transformed into a coupled set of first- 
order equations called the state-variable form. So a general way of expressing 
the dynamics of a linear system is 


x = Fx + Gu (2.1) 
y = Hx + Ju. (2.2) 


where the column vector x is called the state of the system and contains n 
elements for an nth-order system, u is the m x 1 input vector to the system, y is 
the p x 1 output vector, F is ann x n system matrix, G is ann x m input matrix, 
H is a p x n output matrix, and J is p x m.' Until Chapter 9, all systems will 
have a scalar input, u, and a scalar output y; in this case, Gisn x 1, His 1 x n, 
and J is a scalar. 

Using this system description, we see that the second-order differential 
equation 


Ï +w, Y + wy = Ku, (2.3) 


can be written in the state-variable form as 


X, - 0 1 X; 0 
sl melle e 
y=[l 0115 . 
X, 


where the state 


is the vector of variables necessary to describe the future behavior of the system, 
given the initial conditions of those variables. 


Laplace Transforms and Transfer Functions 


The analysis of linear systerns is facilitated by use of the Laplace transform. The 
most important property of the Laplace transform (with zero initial conditions) 
is the transform of the derivative of a signal. 


L{f (t)} = s F(s). (2.5) 


1 It is also common to use A, B, C, D in place of F, G, H, J as MATLAB does throughout. We prefer to 
use F, G ... for a continuous plant description, A, B ... for compensation, and ®, I’... for the discrete 
plant description in order to delineate the various system equation usages. 
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This relation enables us to find easily the transfer function, G(s), of a linear 
continuous system, given the differential equation of that system. So we see that 
Eq. (2.3) has the transform 


(s? + 2¢w,5 + a) Y (s) = K,U(s), 
and, therefore, the transfer function, G(s), is 
_ Y(s) K 


o 


G(s) = -= ———————— 
(s) U(s) s +205 +0? 


CACSD software typically accepts the specification of a system in either the 
state-variable form or the transfer function form. The quantities specifying the 
state-variable form (Eqs. 2.1 and 2.2) are F, G, H, and J. This is referred to 
as the "ss" form in MATLAB. The transfer function is specified in a polynomial 


form (“tf”) or a factored zero-pole-gain form (“zpk”). The transfer function in 
polynomial form is 


bs" + b,s" +- +b 


G(s) = n n-l : 
as bas bea 


(2.6) 


where the MATLAB quantity specifying the numerator is a 1 x (m + 1) matrix of 
the coefficients, for example 


num = lb, b, basil 


and the quantity specifying the denominator is a 1 x (n + 1) matrix, for example 
denz[a, a, 1% Qn, i) 


In MATLAB with Control System Toolbox the numerator and denominator are 
combined into one system specification with the statement 
sys = tf(num,den). 


In the zero-pole-gain form, the transfer function is written as the ratio of two 
factored polynomials, 
II, (s — Ži ) 


G(s) = K , 
Ec NE 


(2.7) 
and the quantities specifying the transfer function are an m x | matrix of the 
zeros, an n x | matrix of the poles, and a scalar gain, for example 

Zi Py 


am| |, gel. ex 


m P n 
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and can be combined into a system description by 
sys = zpk(z,p,k). 


For the equations of motion of a system with second-order or higher equa- 
tions, the easiest way to find the transfer function is to use Eq. (2.5) and do the 
math by hand. If the equations of motion are in the state-variable form and the 
transfer function is desired, the Laplace transform of Eqs. (2.1) and (2.2) yields 


G(s) = 29) H(sI—F)"'G+ J. 
u(s) 


In MATLAB, given F, G, H, and J, one can find the polynomial transfer function 
form by the MATLAB script 


sys = tf(ss(F,G,H,J)) 
or the zero-pole-gain form by 
sys = zpk(ss(F,G,H,J)). 
Likewise, one can find a state-space realization of a transfer function by 


sys = ss(tf(num,den)). 


Output Time Histories 


Given the transfer function and the input, u(t), with the transform U(s), the 
output is the product, 


Y(s) = G(s)U(s). (2.8) 


The transform of a time function can be found by use of a table (See Ap- 
pendix B.2); however, typical inputs considered in control system design are 
steps 


R 
u(t) — RJ) => Us)=—, 
ramps 
V 
u(t) = Vtl(t), => U(s)= 
s 
parabolas 
Ar A 
u()- ——10) > U(s) = “8, 
2 S 
and sinusoids 
B 
u(t) — Bsin(ot)l(r), > Uls) = 4— 


s tof 
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Figure 2.1 
An elementary feedback 
system 
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Using Laplace transforms, the output Y (s) from Eq. (2.8) is expanded into its 
elementary terms using partial fraction expansion, then the time function associ- 
ated with each term is found by looking it up in the table. The total time function, 
y(t), is the sum of these terms. In order to do the partial fraction expansion, 
it is necessary to factor the denominator. Typically, only the simplest cases are 
analyzed this way. Usually, system output time histories are solved numerically 
using computer based methods such as MATLAB's step.m for a step input or 
lsim.m for an arbitrary input time history. However, useful information about 
system behavior can be obtained by finding the individual factors without ever 
solving for the time history, a topic to be discussed later. These will be important 
because specifications for a control system are frequently given in terms of these 
time responses. 


The Final Value Theorem 


A key theorem involving the Laplace transform that is often used in control 
system analysis is the final value theorem. It states that, if the system is stable 
and has a final, constant value 


lim x(t) = x,, = lim sX(s). (2.9) 
1—00 = 
The theorem allows us to solve for that final value without solving for the system’s 


entire response. This will be very useful when examining steady-state errors of 
control systems. 


Block Diagrams 


Manipulating block diagrams is useful in the study of feedback control systems. 
The most common and useful result is that the transfer function of the feedback 
system shown in Fig. 2.1 reduces to 


Y(s) |. G(s) 
R(s) 14 H(sG(s) 


4 
ii 9 Es 


(2.10) 
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impulse response 


stability 


time constant 


2.1.6 


Response versus Pole Locations 


Given the transfer function of a linear system, 


the values of s such that a(s) = 0 will be places where H (s) is infinity, and these 
values of s are called poles of H(s). On the other hand, values of s such that 
b(s) = 0 are places where H(s) is zero,and the corresponding s locations are 
called zeros. Since the Laplace transform of an impulse is unity, the impulse 
response is given by the time function corresponding to the transfer function. 
Each pole location in the s-plane can be identified with a particular type of 
response. In other words, the poles identify the classes of signals contained in the 
impulse response, as may be seen by a partial fraction expansion of H (s). For a 
first order pole 


H(s)= 


sto’ 


Table B.2, Entry 8, indicates that the impulse response will be an exponential 
function; that is 


A(t) = e?'l(r). 


When c > 0, the pole is located at s < 0, the exponential decays, and the system 
is said to be stable. Likewise, if o < 0, the pole is to the right of the origin, the 
exponential grows with time and is referred to as unstable. Figure 2.2 shows a 
typical response and the time constant 
l 
T=— (2.11) 
o 
as the time when the response is 1/e times the initial value. 
Complex poles can be described in terms of their real and imaginary parts, 
traditionally referred to as 


s = —0 ijo, 


This means that a pole has a negative real part if ø is positive. Since complex poles 
always come in complex conjugate pairs for real polynomials, the denominator 
corresponding to a complex pair will be 


als) = (s +0 — jw,)(s +a + jo,) =(s+o)?+ 0%. (2.12) 
When finding the transfer function from differential equations, we typically write 


the result in the polynomial form 
2 


w 
H(s) = =————— 2:13 
s 5? -- I 4- e ee 


Figure 2.2 
First-order system 
response 


damping ratio 


Figure 2.3 
s-plane plot for a pair of 
complex poles 
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| Time (sec) 


27 


By expanding the form given by Eq. (2.12) and comparing with the coefficients 
of the denominator of H(s) in Eq. (2.13), we find the correspondence between 


the parameters to be 
o=ty and &,=@/1—¢?, (2.14) 


where the parameter ¢ is called the damping ratio, and w, is called the un- 
damped natural frequency. The poles of this transfer function are located at 
a radius c, in the s-plane and at an angle 0 = sin"! £, as shown in Fig. 2.3. 
Therefore, the damping ratio reflects the level of damping as a fraction of the 
critical damping value where the poles become real. In rectangular coordinates, 
the poles are at s = ~o + jw,. When 4 = 0 we have no damping, 0 = 0, and 
w ,, the damped natural frequency, equals c, , the undamped natural frequency. 


Im(s) 


0 — sin^!Z 


Re(s) 
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step response 


For the purpose of finding the time response corresponding to a complex 
transfer function from Table B.2, it is easiest to manipulate the H (s) so that the 
complex poles fit the form of Eq. (2.12), because then the time response can be 
found directly from the table. The H (s) from Eq. (2.13) can be written as 

He o, 
ET AET 
therefore, from Entry 21 in Table B.2 and the definitions in Eq. (2.14), we see 
that the impulse response is 


h(t) = «,e *' sin(w,t) l(t). 


For œw, = 3 rad/sec and 4 — 0.2, the impulse response time history could be 
obtained and plotted by the MATLAB statements: 


Wn = 3; 

Ze = 0.2; 

s=tf('s}) 

sys = (Wn^2)/s^2*2*Ze*Wn*s*Wn^2) 


impulse(sys); 


It is also interesting to examine the step response of H(s), that is, the 
response of the system H (s) to a unit step input u = 1(f) where U(s) = t. The 
step response transform given by Y(s) = H(s)U(s), contained in the tables in 
Entry 22, is 


y(t)21-e* (eso + Z- sina) (2.15) 
d 


where o, = 9, y/1— g?ando = fw, . This could also be obtained by modifying 
the last line in the MATLAB description above for the impulse response to 


step(sys) 


Figure 2.4 is a plot of y(t) for several values of ¢ plotted with time nor- 
malized to the undamped natural frequency w,. Note that the actual frequency, 
€, decreases slightly as the damping ratio increases. Also note that for very 
low damping the response is oscillatory, while for large damping (¢ near 1) the 
response shows no oscillation. A few step responses are sketched in Fig. 2.5 to 
show the effect of pole locations in the s-plane on the step responses. It is very 
useful for control designers to have the mental image of Fig. 2.5 committed to 
memory so that there is an instant understanding of how changes in pole locations 
influence the time response. The negative real part of the pole, c, determines the 
decay rate of an exponential envelope that multiplies the sinusoid. Note that if o 
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Figure 2.4 

Step responses of 
second-order systems 
versus £ 


y(i) 


Figure 2.5 

Time functions 
associated with points in 
the s-plane 


Im(s) 


LHP RHP 


Re(s) 


x 
i/ | | E3 


is negative, the pole is in the right-half plane, the response will grow with time, 
and the system is said to be unstable. If ø = 0, the response neither grows nor 
decays, so stability is a matter of definition. If o is positive, the natural response 
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2.1.7 


Figure 2.6 

Definition of rise time t,, 
settling time t,, and 
overshoot M, 


decays and the system is said to be stable. Note that, as long as the damping is 
strictly positive, the system will eventually converge to the commanded value. 

All these notions about the correspondence between pole locations and the 
time response pertained to the case of the step response of the system of Eq. (2.13), 
that is, a second-order system with no zeros. If there had been a zero, the effect 
would generally be an increased overshoot; the presence of an additional pole 
would generally cause the response to be slower. If there had been a zero in the 
right-half plane, the overshoot would be repressed and the response would likely 
go initially in the opposite direction to its final value. Nevertheless, the second- 
order system response is useful in guiding the designer during the iterations 
toward the final design, no matter how complex the system is. 


Time-Domain Specifications 


Specifications for a control system design often involve certain requirements 
associated with the time response of the system. The requirements for a step 
response are expressed in terms of the standard quantities illustrated in Fig. 2.6: 


The rise time ;, is the time it takes the system to reach the vicinity of its new 
set point. 


The settling time z, is the time it takes the system transients to decay. 


The overshoot M, is the maximum amount that the system overshoots its 
final value divided by its final value (and often expressed as a percentage). 
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For a second-order system, the time responses of Fig. 2.4 yield information 
about the specifications that is too complex to be remembered unless approxi- 
mated. The commonly used approximations for the second-order case with no 


zeros are 
1.8 
ie (2.16) 
e, 
de AG 
I. [^ qua D, Pg (2.17) 
$0, o 
M,sem s Jaggi (2.18) 


The overshoot M, is plotted in Fig. 2.7. Two frequently used values from this 
curve are M, = 16% for  — 0.5 and M, = 5% for £ = 0.7. 

Equations (2.16)-(2.18) characterize the transient response of a system hav- 
ing no finite zeros and two complex poles with undamped natural frequency w,, 
damping ratio Z, and negative real part c. In analysis and design, they are used 
to obtain a rough estimate of rise time, overshoot, and settling time for just about 
any system. It is important to keep in mind, however, that they are qualitative 
guides and not precise design formulas. They are meant to provide a starting 
point for the design iteration and the time response should always be checked 
after the control design is complete by an exact calculation, usually by numerical 
simulation, to verify whether the time specifications are actually met. If they have 
not been met, another iteration of the design is required. For example, if the rise 


Figure 2.7 

Plot of the peak 
overshoot M, versus the 
damping ratio z for the 
second-order system 


Mp, % 
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2.2.1 


characteristic equation 


Figure 2.8 


A unity feedback system 


time of the system turns out to be longer than the specification, the target natural 
frequency would be increased and the design repeated. 


Basic Properties of Feedback 


An open-loop system described by the transfer function G(s) can be improved 
by the addition of feedback including the dynamic compensation D(s) as shown 
in Fig. 2.8. The feedback can be used to improve the stability, speed up the tran- 
sient response, improve the steady-state error characteristics, provide disturbance 
rejection, and decrease the sensitivity to parameter variations. 


Stability 


The dynamic characteristics of the open-loop system are determined by the poles 
of G(s) and D(s), that is, the roots of the denominators of G(s) and D(s). Using 
Eq. (2.10), we can see that the transfer function of the closed-loop system in 
Fig. 2.8 is 


Y(s D(s)G(s 

ON ic IS) d (2.19) 

R(s) 1 + D(s)G(s) 
sometimes referred to as the complementary sensitivity. In this case, the dy- 
namic characteristics and stability are determined by the poles of the closed-loop 
transfer function, that is, the roots of 


1+ D(s)G(s) = 0. (2.20) 


This equation is called the characteristic equation and is very important in 
feedback control analysis and design. The roots of the characteristic equation 
represent the types of motion that will be exhibited by the feedback system. It 
is clear from Eq. (2.20) that they can be altered at will by the designer via the 
selection of D(s). 


Disturbance, w 


Command 
input 


Measurement noise, v 


system type 


Table 2.1 


2.2 
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Steady-State Errors 


The difference between the command input r (see Fig. 2.8) and the output y is 
called the system error, e. Using Eq. (2.10) for the case where the desired output 
is e, we find that 

E(s) E l 

R(s) 1+ D(s)G(s) 
sometimes referred to as the sensitivity. For the case where r(t) is a step input 
and the system is stable, the Final Value Theorem tells us that 


= S(s), (2.21) 


where 
K = lim D(s)G(s) 
P 5-0 


and is called the position-error constant. If D(s)G(s) has a denominator that 
does not have s as a factor, K, and e.. are finite. This kind of system is referred 
to as type 0. 

These results can also be seen qualitatively by examining Fig. 2.8. In order 
for y to be at some desired value (= r), the higher the forward loop gain of 
DG (defined to be K) the lower the value of the error, e. An integrator has the 
property that a zero steady input can produce a finite output, thus producing an 
infinite gain. Therefore, if there is an integrator in D or G, the steady-state gain 
will be co and the error will be zero. 

Continuing, we define the velocity constant as 


K,= lim s D(s)G(s) 
and the acceleration constant as 

K, = lim s* D(s)G(s). 
When K, is finite, we call the system type 1; likewise, when K, is finite, we 
call the system ty pe 2. For the unity feedback case, it is convenient to categorize 


the error characteristics for command inputs consisting of steps, ramps, and 
parabolas. Table 2.1 summarizes the results. 


Errors versus system type for unity feedback 


Step | Ramp Parabola 


Type 0 TE) co oo 
Type 1 0 = co 
Type 2 0 0 x 
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2.2.3 


2.3 


System type can also be defined with respect to the disturbance inputs w. 
The same ideas hold, but in this case the type is determined by the number of 
integrators in D(s) only. Thus, if a system had a disturbance as shown in Fig. 2.8 
which was constant, the steady-state error e, of the system would only be zero 
if D(s) contained an integrator. 


PID Control 


Proportional, integral, and derivative (PID) control contains three terms. They 
are proportional control 


u(t) = Kett) => D(s)=K, (2.22) 
integral control 
u(t) = a endn = D(s)= m (2.23) 
and derivative control 
u(t) - KT,e(t) => D(s)=KT,s. (2.24) 


T, is called the integral (or reset) time, 7,, the derivative time, and K the position 
feedback gain. Thus, the combined transfer function is 

D(s) = us) =K(1+ 4 +T,s). (2.25) 

e(s) is 

Proportional feedback control can lead to reduced errors to disturbances but 
still has a small steady-state error. It can also increase the speed of response 
but typically at the cost of a larger transient overshoot. If the controller also 
includes a term proportional to the integral of the error, the error to a step can be 
eliminated as we saw in the previous section. However, there tends to be a further 
deterioration of the dynamic response. Finally, addition of a term proportional 
to the error derivative can add damping to the dynamic response. These three 
terms combined form the classical PID controller. It is widely used in the process 
industries and commercial controller hardware can be purchased where the user 
only need “tune” the gains on the three terms. 


Root Locus 


The root locus is a technique which shows how changes in the system’s open-loop 
characteristics influence the closed-loop dynamic characteristics. This technique 
allows us to plot the locus of the closed-loop roots in the s-plane as an open-loop 
parameter varies, thus producing a root locus. The root locus method is most 
commonly used to study the effect of the loop gain (K in Eq. (2.25)); however, 
the method is general and can be used to study the effect of any parameter in D(s) 


2.3.1 


root locus definition 
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or G(s). In fact, the method can be used to study the roots of any polynomial 
versus parameters in that polynomial. 

A key attribute of the technique is that it allows you to study the closed-loop 
roots while only knowing the factors (poles and zeros) of the open-loop system. 


Problem Definition 


The first step in creating a root locus is to put the polynomials in the root locus 
form 


1 + K— =0. (2.26) 


Typically, Kb(s)/a(s) is the open loop transfer function D(s)G(s) of a feedback 
system; however, this need not be the case. The root locus is the set of values 
of s for which Eq. (2.26) holds for some real value of K. For the typical case, 
Eq. (2.26) represents the characteristic equation of the closed-loop system. 

The purpose of the root locus is to show in a graphical form the general trend 
of the roots of a closed-loop system as we vary some parameter. Being able to do 
this by hand (1) gives the designer the ability to design simple systems without a 
computer, (2) helps the designer verify and understand computer-generated root 
loci, and (3) gives insight to the design process. 

Equation (2.26) shows that, if K is real and positive, b(s)/a(s) must be real 
and negative. In other words, if we arrange b(s)/a(s) in polar form as magnitude 
and phase, then the phase of b(s)/a(s) must be 180°. We thus define the root 
locus in terms of the phase condition as follows. 


180° locus definition: The root locus of b(s)/a(s) is the set of points in the 
s-plane where the phase of b(s)/a(s) is 180°. 


Since the phase is unchanged if an integral multiple of 360° is added, we 
can express the definition as? 


ZB) = 180° + 360°, 
a(s) 


where / is any integer. The significance of the definition is that, while it is very 
difficult to solve a high-order polynomial, computation of phase is relatively 
easy. When K is positive, we call this the positive or 180° locus. When K is 
real and negative, b(s)/a(s) must be real and positive for s to be on the locus. 
Therefore, the phase of b(s) /a(s) must be 0°. This case is called the 0° or negative 
locus. 


2 £ refers to the phase of ( ). 
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2.3.2 Root Locus Drawing Rules 


The steps in drawing a 180° root locus follow from the basic phase definition. 
They are 


STEP | On the s-plane, mark poles (roots of &(5)) by an x and zeros (roots of 
a(s)) by a o. There will be a branch of the locus departing from every pole and a 
branch arriving at every zero. 


STEP2 Draw the locus on the real axis to the left of an odd number of real 
poles plus zeros. 


STEP3 Draw the asymptotes, centered at œ and leaving at angles $,, where 


n — m = number of asymptotes 
n = order of a(s) 


m = order of b(s) 
E LP Sato. 
a n-m n-m ` 
180° + (| — 1)360° 


= 121,2. n-m. 


n-m 
For n — m > 0, there will be a branch of the locus approaching each asymptote 
and departing to infinity. For n — m <0, there will be a branch of the locus 
arriving from infinity along each asymptote. 


STEP4 Compute locus departure angles from the poles and arrival angles at 
the zeros where 


Iba = » , V, — 9 4, — 180^ — 1360° 
IVar = 30, — Y v, + 180° + 1360? 


where q is the order of the pole or zero and / takes on q integer values so that the 
angles are between +180°. y; is the angle of the line going from the i, pole to 
the pole or zero whose angle of departure or arrival is being computed. Similarly, 
$, is the angle of the line from the i,, zero. 


STEP 5 If further refinement is required at the stability boundary, assume s, = 
jæ, and compute the point(s) where the locus crosses the imaginary axis for 
positive K. 


STEP6 For the case of multiple roots, two loci come together at 180° and break 
away at +90°. Three loci segments approach each other at angles of 120° and 
depart at angles rotated by 60°. 


STEP? Complete the locus, using the facts developed in the previous steps and 
making reference to the illustrative loci for guidance. The loci branches start at 
poles and end at zeros or infinity. 
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STEP 8 Select the desired point on the locus that meets the specifications (s,), 
then use the magnitude condition from Eq. (2.26) to find that the value of K 
associated with that point is 


E 
[b(s,)/a(s,)| 
When K is negative, the definition of the root locus in terms of the phase 
relationship is 


O° locus definition: The root locus of b(s)/a(s) is the set of points in the s-plane 
where the phase of b(s)/a(s) is 0°. 


For this case, the Steps above are modified as follows 
STEP 2 Draw the locus on the real axis to the left of an even number of real 
poles plus zeros. 
STEP 3 The asymptotes depart at 


l — 1)360° 
di Ee TE E 
n—m 


STEP 4 The locus departure and arrival angles are modified to 


dhap = 9 V, - 9,6, — 1360" 
Wan = 9 4; — 9 V, + 360. 


Note that the 180? term has been removed. 


€ Example 2.1 


Root Locus Shetch 


Sketch the root locus versus K (positive and negative) for the case where the open-loop system 
is given by 


s 
G(s) = K=—.. 
i s +l 


Solution. First let’s do the 180° locus. 


STEP 1: There is a zero at s = 0 and poles ats = +jw. 
STEP 2: There is a locus on the entire negative real axis. 


STEP 3: n — m = l, therefore, there is one asymptote and it departs at 180°; that is, along 
the negative real axis. 


STEP 4: The departure angle at the upper pole is calculated as 
$, = 90° — 90° — 180° = —180°, 
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Figure 2.9 


Example root locus 


sketch 


Imag Axis 
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thus, the locus departs from the upper pole horizontally to the left. The departure angle from 
the lower pole also turns out to be — 180? and that branch of the locus also departs horizontally 
to the left. 


We know that there is a locus segment along the entire negative real axis; however, we 
also know that there is a locus branch moving to the right and arriving at the zero, and that 
there is a branch departing to infinity at the far left. Therefore, the two branches from the poles 
must join the real axis at some point and split in opposite directions. It turns out that the two 
complex branches form a semi-circle as they approach the real axis. The solid lines in Fig. 2.9 
show the sketch of this 180? locus. 

For the 0° locus, there is a segment along the positive real axis and the angles of departure 
are both 0°. The result is shown in the figure by the dotted lines. 


* 


2.3.3 


Computer-Aided Loci 


The most common approach to machine computation of the root locus is to cast 
the problem as a polynomial in the form a(s) + Kb(s) = 0, and, for a sequence 
of values of K varying from near zero to a large value, solve the polynomial for 
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its n roots by any of many available numerical methods. A disadvantage of this 
method is that the resulting root locations are very unevenly distributed in the 
s-plane. For example, near a point of multiple roots, the sensitivity of the root 
locations to the parameter value is very great, and the roots just fly through such 
points, the plots appear to be irregular, and sometimes important features are 
missed. As a result, it is useful to have the root locus plotting rules in mind when 
interpreting computer plots. The polynomial is generally solved by transforming 
the problem to state-variable form and using the OR algorithm which solves for 
the eigenvalues of the closed-loop system matrix. 


$ Example 2.2 


CACSD Root Locus 
1. Plot the root locus using MATLAB for the open-loop system shown in Fig. 2.8 with 
10 STS 
G(s) = : d D(s)= K : 
Oey dm ASS ag 


2. Find the gain K associated with the point of maximum damping of the complex roots for 
the given D(s) and plot the step response with that value of K. 


3. Reconcile the root locus plot with the hand plotting rules and compare the computer-based 
step response with the rules of thumb in Eqs. (2.16)-(2.18). 


Solution. 


1. The MATLAB script following will generate the desired locus plot which is shown in 
Fig. 2.10(a). 


s=tf('s'); 
sysG=10/(s*%2+2*s); 
sysD = (s+3)/(s+10); 
sys=sysG*sysD; 
rlocus(sys) 

2. The statement 
[K,p] = rlocfind(sys) 


will place a cross-hair on the plot which can be moved with the mouse to the desired point 
on the locus in order to find the associated gain K and pole locations p. Given this value 
of K, (= 3.7) the script 


K= 37 
sysCL = feedback(K* sys, 1) 
step(sysCL) 


produces the desired step response shown in Fig. 2.10(b). 
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Figure 2.10 

Example of CACSD for 
(a) root locus and (b) 
step response 
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. The root locus in Fig. 2.10(a) has locus segments to the left of odd numbers of poles and 


zeros (Step 2); has two asymptotes departing at 3-90? and centered at 
—24+3-10 

— eir s 

(Step 3); and has branches departing the real axis at the multiple roots between the two 


poles at +90° (Step 6). The gain associated with the desired root at s = —3.5 + j3.1 can 
be determined from Step 8 by 


= —4.5, 


(4.7)(3.5)(7.2) _ 


K= =T 
(3.2)(10) 
where 4.7 is the distance from the root to the pole at s = 0; 3.5 is the distance to the pole 
at s = —2; 7.2 is the distance to the pole at s = —10; 3.2 is the distance to the zero at 


s = —3; and 10 is from the gain of G(s). 

The step response shown in Fig. 2.10(b) has ¢, = 0.4 sec, t, = 1.4 sec, and M, z= 
10%. The closed-loop roots with K = 3.7 are at s = —5.1, —3.5 + j3.1; thus, for the 
complex roots, ¢ = 0.4, w, = 4.7 rad/sec, and ø = 3.5. The predicted values given in 
Section2. 1. 7 suggest that 


1.8 
— 0.38 sec 


= 1.3 sec 


2 
M, e "UN 1-7 — 4% (Fig. 2.7) 
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The predicted values based on the second order system with no zeros predict a t, and t, 
that are a little faster than the plot due to the presence of the extra pole. The predicted 
Mp of 4% is much smaller than the 10% shown on Fig. 2.10(b), also a result of the extra zero. 
If a lower overshoot was required, the zero would need to be moved to the right as needed. 


* 


2.4 Frequency Response Design 
The response of a linear system to a sinusoidal inputis referred to as the system's 
frequency response. A system described by 
Y (s) 
Gesn — (20). 
UG) (s) 
where the input u(t) is a sine wave with an amplitude of U, and frequency w 
u(t) = U, sin at, 


which has a Laplace transform 


U( ) Uo, 
s) = >, 
s+ e 
has a response with the transform, 
U w 
Y(s) = G(s) =". 2.27 
(s) ( 3 iem (2.27) 


A partial fraction expansion of Eq. (2.27) will result in terms that represent the 
natural behavior of G(s) and terms representing the sinusoidal input. Providing 
that all the natural behavior is stable, those terms will die out and the only terms 
left in the steady state are those due to the sinusoidal excitation, that is 


* 


a a 
ee -— (2.28) 
o SHj@, S— jw, 


where @, and o7 would be found by performing the partial fraction expansion. 
After the natural transients have died out, the time response is 


y(t) = 2|a,| sin(w t +) = U,A sin(@w,t + 9) 


where 


A-|GGe)l 2I1GG)| — . (2.29) 


S=jo, 


_, In[G(jo,)] 


à zone ; 
$ = tan Re[G Go) Z G(jo) (2.30) 


31 
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bandwidth 


2.4.1 


So, a stable linear system G(s) excited by a sinusoid will eventually exhibit a 
sinusoidal output y with the same frequency as the input u. The magnitude, A (c) 
of y with respect to the input, = |G(j@,)| and the phase, ¢(w, ), is LG(jw,); that 
is, the magnitude and phase of G(s) is evaluated by letting s take on values along 
the imaginary (jw) axis. In addition to the response to a sinusoid, the analysis of 
the frequency response of a system is very useful in the determination of stability 
of a closed-loop system given its open-loop transfer function. 

A key reason that the frequency response is so valuable is that the designer 
can determine the frequency response experimentally with no prior knowledge of 
the system’s model or transfer function. The system is excited by a sinusoid with 
varying frequency and the magnitude A(w) is obtained by a measurement of the 
ratio of the output sinusoid to input sinusoid in the steady-state at each frequency. 
The phase (w) is the measured difference in phase between input and output 
signals. As an example, frequency responses of the second-order system 


l 
~ (s/@,)? + 26(s/w,) +1 


are plotted for various values of ¢ in Fig. 2.11 which is done by MATLAB with 
bode(sys). 


G(s) 


Specifications 


A natural specification for system performance in terms of frequency response 
is the bandwidth, defined to be the maximum frequency at which the output of 
a system will track an input sinusoid in a satisfactory manner. By convention, 
for the system shown in Fig. 2.12 with a sinusoidal input r, the bandwidth is the 
frequency of r at which the output y is attenuated to a factor of 0.707 times the 
input (or down 3 dB). Figure 2.13 depicts the idea graphically for the frequency 
response of the closed-loop transfer function (defined to be 7 (s) in Eq. (2.19)) 


Y) x KOO 
RG) ©) — C XGG 


The plot is typical of most closed-loop systems in that 1) the output follows the 
input, |7 | = 1, at the lower excitation frequencies, and 2) the output ceases to 
follow the input, |7| < 1, at the higher excitation frequencies. 

The bandwidth w, is a measure of the speed of response and is therefore 
similar to the time-domain measure of rise time t, or the s-plane measure of 
natural frequency c,. In fact, it can be seen from Fig. 2.11 that the bandwidth 
will be equal to the natural frequency when Z = 0.7. For other damping ratios, the 
bandwidth is approximately equal to the natural frequency with an error typically 
less than a factor of 2. 

The resonant peak M, is a measure of the damping, as evidenced by Fig. 2.11 


where the peak is approximately the value at w = c, , which is 23 for ġ < 0.5. 
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Figure 2.11 

(a) Magnitude and (b) 
phase of a second-order 
system 


3 
2 
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Figure 2.12 
Simplified system 
definition 


Figure 2.13 
Definitions of bandwidth 
and resonant peak 
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Bode Plot Techniques 


It is useful to be able to plot the frequency response of a system by hand in order 
to (a) design simple systems without the aid of a computer, (b) check computer- 
based results, and (c) understand the effect of compensation changes in design 
iterations. H. W. Bode developed plotting techniques in the 1930s that enabled 
quick hand plotting of the frequency response. His rules are: 


STEP] Manipulate the transfer function into the Bode form 


jwt, + D)jot, +1)--- 
KG(jo) = K (uy POF DUO FO 
(jot, + Dot, +1)--- 
STEP 2 Determine the value of n for the K,(jw)" term. Plot the low-frequency 
magnitude asymptote through the point K, at w = 1 rad/sec with a slope of n (or 
nx 20 GB per decade). 


STEP 3 Determine the break points where w = | /t,. Complete the composite 
magnitude asymptotes by extending the low frequency asymptote until the first 
frequency break point, then stepping the slope by +1 or +2, depending on 
whether the break point is from a first or second order term in the numerator or 
denominator, and continuing through all break points in ascending order. 


STEP 4 Sketch in the approximate magnitude curve by increasing from the 
asymptote by a factorof 1.4 (+3 dB) at first order numerator breaks and decreasing 
it by a factor of 0.707 (—3 dB) at first order denominator breaks. At second order 
break points, sketch in the resonant peak (or valley) according to Fig. 2.1 1(a) 
using the relation that |G(jw)| = 1/(2¢) at the break. 


2.4.3 


Figure 2.14 
Phase plot for jer + 1; 
t—0.1 
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STEP 5 Plot the low frequency asymptote of the phase curve, @ = n x 90°. 


STEP6 As a guide, sketch in the approximate phase curve by changing the 
phase gradually over two decades by +90° or +180° at each break point in 
ascending order. For first order terms in the numerator, the gradual change of 
phase is +90°; in the denominator, the change is —90°. For second order terms, 
the change is +180°. 


STEP 7 Locate the asymptotes for each individual phase curve so that their 
phase change corresponds to the steps in the phase from the approximate curve 
indicated by Step 6. Sketch in each individual phase curve as indicated by Fig. 2.14 
or Fig. 2.11(b). 


STEP 8 Graphically add each phase curve. Use dividers if an accuracy of about 
+5° is desired. If lessor accuracy is acceptable, the composite curve can be done 
by eye, keeping in mind thatthe curve will start at the lowest frequency asymptote 
and end on the highest frequency asymptote, and will approach the intermediate 
asymptotes to an extent that is determined by the proximity of the break points 
to each other. 


Steady-State Errors 


Section 2.2.2 showed that the steady-state error of a feedback system decreases 
as the gain of the open loop transfer function increases. In plotting a composite 
magnitude curve, the low frequency asymptote is given by 


KG(jo) = K (jo. (2.31) 


Therefore, we see that the larger the value of the magnitude on the low-frequency 
asymptote, the lower the steady-state errors will be for the closed loop system. 
This idea is very useful in designing compensation. 


Asymptote 


Break point p 


Z(jol0 + 1) 


-30° —— 
0.01 0.02 01 02 04 l 


w (rad/sec) 
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For a system with n = 0, (a type 0 system) the low frequency asymptote is 
a constant and the gain K, of the open loop system is equal to the position error 
constant, K . For a system where n = —1, (a type 1 system) the low frequency 
asymptote has a slope of —1 and K, = K, 

The easiest way of determining the value of K, in a type 1 system is to read 
the magnitude of the low frequency asymptote at a frequency low enough to be 
well below the any of the break points because zs equals the magnitude at these 
frequencies. In some cases, the lowest frequency break point will be below w = 1 
rad/sec, therefore the asymptote can be extended to w = | rad/sec in order to read 
K, directly. 


Stability Margins 


If the closed-loop transfer function of a system is known, the stability of the 
system can be determined by simply inspecting the denominator in factored 
form to observe whether the real parts are positive or negative. However, the 
closed-loop transfer function is not usually known; therefore, we would like 
to determine closed-loop stability by evaluating the frequency response of the 
open-loop transfer function K G(jq@) and then performing a simple test on that 
response. This can be done without a math model of the system by experimentally 
determining the open-loop frequency response. 

We saw in Section 2.3.1 that all points on the root locus have the property 
that 


|KG(s)| 21 and Z(KG(s)) = 180°. 


At the point of neutral stability we see that these root-locus conditions hold for 
5 = jo, so 


|KG(jo) 21 and Z(KG(ja)) = 180°. (2.32) 


Thus a Bode plot of a system that is neutrally stable (that is, with the value of 
K such that the closed-loop roots fall on the imaginary axis) will satisfy the 
conditions of Eq. (2.32). That means that the magnitude plot must equal | at the 
same frequency that the phase plot equals 180°. Typically, a system becomes less 
stable as the gain increases; therefore, we have the condition for stability 


IKG(je)) <1 at Z(KG(jo)) = —180*. (2.33) 


This stability criterion holds for all systems where increasing gain leads to insta- 
bility and |K G(j &)| crosses the magnitude = 1 once, the most common situation. 
However, there are systems where an increasing gain can lead from instability to 
stability and in this case, the stability condition is 


IKG(jo) »1 at Z(KG(jo) = —180. 


gain margin 
phase margin 


2.4.5 


crossover frequency 
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One way that will frequently resolve the ambiguity is to perform a rough sketch of 
the root locus to resolve the question of whether increasing gain leads to stability 
or instability. The rigorous way to resolve the ambiguity is to use the Nyquist 
stability criterion, which is reviewed in Section 7.5.1 for continuous systems. 

Two quantities that measure the stability margin of a system are directly 
related to the stability criterion of Eq. (2.33): gain margin and phase margin. 
The gain margin (GM) is the factor by which the gain is less than the neutral 
stability value when the phase = 180°. The phase margin (PM) is the amount by 
which the phase of G(s) exceeds —180^ when |KG(jw)| = 1. The two margins 
are alternate ways of measuring the degree to which the stability conditions of 
Eq. (2.33) are met. 

The phase margin is generally related to the damping of a system. For a 
second-order system, the approximation that 


~ PM 
~ 100 


is commonly used. Therefore, if it were known that a system was to be designed 
using frequency response methods, it would make sense to specify the speed of 
response of the system in terms of a required bandwidth and the stability of the 
system in terms of a required phase margin. 


i4 


Bode's Gain-Phase Relationship 


One of Bode's important contributions is his theorem that states 


For any minimum phase system (that is, one with no time delays, RHP zeros or 
poles), the phase of G (jw) is uniquely related to the integral of the magnitude 
of G(jw). 


When the slope of |G(jw)| versus w on a log-log scale persists at a constant 
value for nearly a decade of frequency, the relationship is particularly simple 


ZG(jw) =n x 90°, (2.34) 


where n is the slope of |G(jw)| in units of decade of amplitude per decade of 
frequency. 

Equation (2.34) is used as a guide to infer stability from |G(jw)| alone. 
When |K G(jo)| = 1, the crossover frequency, the phase 


ZG(jo) 
ZG(jo) 


For stability we want 2 G(jw) > —180° for a PM > 0. Therefore we adjust the 
|KG(jw)| curve so that it has a slope of — 1 at the crossover frequency. If the 
slope is —1 for a decade above and below the crossover frequency, the PM would 
be approximately 90°; however, to ensure a reasonable PM, it is usually only 
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necessary to insist on a — | slope (—20 dB per decade) persisting for a decade in 
frequency that is centered at the crossover frequency. 


Design 


One of the very useful aspects of frequency-response design is the ease with 
which we can evaluate the effects of gain changes. In fact, we can determine the 
PM for any value of K without redrawing the magnitude or phase information. 
We need only indicate on the figure where | K G(jw)|= 1 for selected trial values 
of K since varying K has the effect of sliding the magnitude plot up or down. 


€ Example 2.3 


Frequency-Response Design 


For a plant given by 
l 


s(s + 1)?’ 
determine the PM and GM for the system with unity feedback and (a) K = 1, (b) determine if 


the system is stable if K = 5, and (c) find what value of K is required to achieve a PM of (i) 
45°, and (ii) 70°. 


G(s) = K 


Solution. 
Using the hand plotting rules, we see that the low frequency asymptote has a slope of —1 
and goes thru magnitude = | at w = | rad/sec. The slope changes to —3 at the break point 


(w = 1). We can then sketch in the actual magnitude curve, noting (STEP 4 in Section 2.4.2) 

that it will go below the asymptote intersection by —6 dB because there is a slope change of 

—2 at that break point. The curve is sketched in Fig. 2.15. The phase curve starts out at —90° 

and drops to —270° along the asymptote as sketched in the figure according to STEP 7. 
Using MATLAB, the statements 


num=1, den={1 2 1 0 
sys = tf(num,den) 
bode(sys). 


will also create the plots of magnitude and phase for this example. The curves are drawn in 
Fig. 2.15 showing the PM and GM for K = I and the same curves are drawn in Fig. 2.16 
showing the PM's for K = 5, 0.5, & 0.2. 


(a) We can read the PM from Fig. 2.15 to be 22°. 
(b) Fig. 2.16 shows that the system is unstable for K=5. 
(c) (i) PM = 45? when K = 0.5, and (ii) PM = 70? when K = 02 


Figure 2.15 
Magnitude and phase 
plots with PM and GM 
for 1/s(s + 1) 
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Compensation 


If the plant dynamics are of such a nature that a satisfactory design cannot be 
achieved by adjustment of the feedback gain alone, then some modification or 
compensation must be made in the feedback to achieve the desired specifications. 
Typically, it takes the form 


S+2Z 


D(s) = K 
(s) E 


where it is called lead compensation if z « p and lag compensation if z > p. 
Lead compensation approximates the addition of a derivative control term and 
tends to increase the bandwidth and the speed of response while decreasing the 
overshoot. Lag compensation approximates integral control and tends to improve 
the steady-state error. 
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Figure 2.16 
PM versus K for 
1s +1 
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The design of lead compensation typically entails placing the zero z at a 
frequency that is lower than the magnitude = | crossover frequency and the pole 
higher than the crossover frequency. Lead compensation provides an increased 
magnitude slope and an increased phase in the interval between these two break 
points; the maximum being halfwaybetween the two break points on a logarithmic 
scale. The maximum phase increase is 


6$ = sin! ira. where o = ze 
lcto 
and is plotted versus o in Fig. 2.17. 
The design of lag compensation typically entails placing both break points 
well below the crossover frequency. Lag compensation decreases the phase in the 
vicinity of the two break points; therefore, z should be well below the crossover 


Figure 2.17 
Maximum phase 
increase for lead 
compensation 
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estimator 
observer 
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Maximum phase lead 


frequency in order to prevent the compensation from degrading the PM and the 
system stability. The primary role of lag compensation is to increase the gain 
(magnitude of the frequency response) at the low frequencies. As we saw in 
Section 2.4.3, this will decrease the steady-state error. 


State-Space Design 


We saw in Section 2.1.] that equations of motion could be written in the state- 
variable form of Eqs. (2.1) and (2.2). The state-space design approach utilizes 
this way of describing the plant and arrives directly with feedback controllers 
(compensation) without the need to determine transforms. Advantages of state- 
space design are especially apparent when the system to be controlled has more 
than one control input or more than one sensed output, called multivariable 
or multi input-multi output (MIMO). However, we will review only the single 
input-single output (SISO) case here. For readers not familiar with state-space 
design, the material in this section is not required for comprehension of the 
remainder of the book. The basic ideas of state-space design are covered in detail 
in Chapter 8 for the discrete case and that chapter does not require that the reader 
be knowledgeable about continuous state-space design. Chapter 9 extends state- 
space design for discrete systems to optimal control design for the multivariable 
case. 

One of the attractive features of the state-space design method is that it 
consists of a sequence of independent steps. The first step, discussed in Section 
2.6.1, is to determine the control. The purpose of the control law is to allow us to 
design a set of pole locations for the closed-loop system that will correspond to 
satisfactory dynamic response in terms of rise-time, overshoot, or other measures 
of transient response. 

The second step—necessary if the full state is not available—is to design an 
estimator (sometimes called an observer), which computes an estimate of the 
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pole placement 


Figure 2.18 
Schematic diagram of 
state-space design 
elements 


entire state vector when provided with the measurements of the system indicated 
by Eq. (2.2). We review estimator design in Section 2.6.2. 

The third step consists of combining the control law and the estimator. 
Figure 2.18 shows how the control law and the estimator fit together and how 
the combination takes the place of what we have been previously referring to as 
compensation. 

The fourth and final step is to introduce the reference input in such a way 
that the plant output will track external commands with acceptable rise-time, 
overshoot and settling time values. Figure 2.18 shows the command input r 
introduced in the same relative position as was done with the transform design 
methods; however, in Section 2.6.4 we will show how to introduce the reference 
input in a different way that results in a better system response. 


Control Law 


The first step is to find the control law as feedback of a linear combination of all 
the state variables—that is, 


xi 


X 
P [Eee |. (2.35) 


x, 


We assume for design purposes that all the elements of the state vector are at our 
disposal, an infrequent situation for an actual system, but an expedient assumption 
for the time being. 

For an nth-order system there will be n feedback gains K,, ... , K,, and 
since there are n roots (or poles) of the system, it is possible that there are enough 
degrees of freedom to select arbitrarily any desired root location by choosing the 


proper values of K,. 


i 
l 
Matrix of State vector i 
constants estimate | 


Compensation 


controllability 
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Substituting the feedback law, Eq. (2.35) into the system described by 
Eq. (2.1) yields 


x = Fx — GKx. (2.36) 
The characteristic equation of this closed-loop system is 
det[sI — (F — GK)] = 0. (2.37) 


When evaluated, this yields an nth-order polynomial in s containing the gains 
K,,..., K,. The control-law design then consists of picking the gains K so that 
the roots of Eq. (2.37) are in desirable locations. Selection of desirable locations 
for the roots is an inexact science and may require some iteration by the designer. 
For now, we will assume that the desired locations are known, say 


HH sare ey 


Then the corresponding desired (control) characteristic equation is 
a (s) = (s —s,)(s — 33) = (Sg, = 0. (2.38) 


Hence the required elements of K are obtained by matching coefficients in 
Eq. (2.37) and Eq. (2.38). This forces the system characteristic equation to be 
identical with the desired characteristic equation and the closed-loop poles to be 
placed at the desired locations. 

The calculation of K can be done providing the system is controllable. 
Systems that are not controllable have certain modes or subsystems that are unaf- 
fected by the control. This usually means that parts of the system are physically 
disconnected from the input. Although there is a mathematical test for controlla- 
bility, it is good practice to insist on the stronger condition that the control input 
be as strongly coupled to the modes of interest as possible. 

It is theoretically possible to solve for K by hand with Eq. (2.37) and 
Eq. (2.38). In practice, this is almost never done. Ackermann's formula for this 
calculation has been implemented in MATLAB as the function acker.m and can 
be used for the design of SISO systems with a small (< 10) number of state vari- 
ables. For more complex cases a more reliable formula is available, implemented 
in MATLAB as the function place. A modest limitation on place.m is that none 
of the desired closed-loop poles are repeated; i.e., that the poles are distinct, a 
requirement that does not apply to acker. Both acker and place require inputs 
consisting of the system description matrices, F and G, and a vector, p, of n 
desired pole locations. Their output is the feedback gain K. Thus the MATLAB 
statements 


K =acker(F,G,p) or K = place(F, G, P) 


will provide the desired value of K. When selecting the desired root locations, it 
is always useful to keep in mind that the control effort required is related to how 
far the open-loop poles are moved by the feedback. Furthermore, when a zero is 
near a pole, the system may be nearly uncontrollable and moving such a pole may 
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LQR 


optimal control 


require large control effort. Therefore, a pole placement philosophy that aims to 
fix only the undesirable aspects of the open-loop response and avoids either large 
increases in bandwidth or efforts to move poles that are near zeros will typically 
allow smaller gains and thus smaller control actuators. 

A method called the linear quadratic regulator (LQR) specifically ad- 
dresses the issue of achieving a balance between good system response and 
the control effort required. The method consists of calculating the gain K that 
minimizes a cost function 


f= Í [x Qx + u" Ru] dt (2.39) 
0 


where Q is an n x n state weighting matrix, R is an m x m control weighting 
matrix, and m is the number of control inputs in a multi-input system. For the 
SISO systems that we are primarily concerned with here, m = | and Ris a scalar 
R. The weights Q and R are picked by the designer by trial-and-error in order to 
arrive at the desired balance between state errors x "x and control usage u°, thus 
avoiding the necessity of picking desired pole locations that do not use excessive 
control. Generally, Q is a diagonal matrix with a weighting factor on one or 
more of the state-vector elements while R = 1. Jt is perfectly acceptable to only 
weight one element, in fact, the element representing the system output is often 
the only element weighted. Rules of thumb that help in picking the weights are 
that (1) the bandwidth of the system increases as overall values in Q increase, 
(2) the damping increases as the term in Q that weights the velocity type state 
elements increase, and (3) a portion of a system can be made faster by increasing 
the weights on the state elements representing that portion of the system. The 
MATLAB statement 


K = lar(F,G,Q,R) 


solves for the K that minimizes the cost, J. 


€ Example 2.4 


State-Space Control Design 


For a plant given by 


l 
G(s) = =, 
(s) 3 
(a) Find the feedback gain matrix K that yields closed-loop roots with w, = 3 rad/sec and Z 


- 0.8. 
(b) Investigate the roots obtained by using LOR with 
| 100 3 
Or sS 


-[; 4 100 0 
g= 0 oj’ p. ale 


and R=]. 
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Solution. The state-variable description of G(s) is (Eq. (2.4) with œw, — 0, ¢ = 0, and 
K =!) 
lio} 87 [i] 
F- , = 
0 0 1 
H-[1 0], J=0. 


(a) The desired characteristic equation is 
a (s) = st 250,5 T w? zx) 


Therefore, the MATLAB script 


F = [0 1;0 0] 
G = [0;1] 
Wn = 3 
Ze = 0.8 


p =roots([1 2*Wn*Ze Wr2)) 
K = acker(F,G,p) 
provides the answer 
K-[9 4.8). 
(b) The scripts 
Q - [1 0;0 O], [100 0;0 0], and [100 0;0 5] 


R=1 
K = Iqr(EG,Q,R) 
p = eig(F - G*K) 


[Wn,Ze] = damp(p) 
compute feedback gains of 
K=([1 1.4], [10 45] and [10 5]. 
which produces natural frequencies of 
w,=1, 32, and 3.2 rad/sec 
and damping of 
£ —0.71, 0.7], and 0.79. 


For this simple example, use of acker is the easier way to find K; however, in more 
complex systems with higher order roots, it is easier to use lqr rather than iterate on the 
best value for all the roots. 


* 


46  Chapter2 Review of Continuous Control 


2.6.2 Estimator Design 


For a system described by Eqs. (2.1) and (2.2), an estimate, X, of the full state 
vector, x, can be obtained based on measurements of the output, y, from 


È = FR + Gu + L(y — Hi). (2.40) 
Here L is a proportional gain defined as 
B= ih esses ^ (2.41) 


and is chosen to achieve satisfactory error characteristics. The dynamics of the er- 
ror can be obtained by subtracting the estimate (Eq. 2.40) from the state (Eq. 2.1), 
to get the error equation 


X = (F — LH)X. (2.42) 
The characteristic equation of the error is now given by 
det[sI — (F — LH)] = 0. (2.43) 


We choose L so that F-LH has stable and reasonably fast eigenvalues, so € decays 
to zero, independent of the control u(r) and the initial conditions. This means 
that X(t) will converge to x(t). 

Errors in the model of the plant (F, G, H) cause additional errors to the state 
estimate from those predicted by Eq. (2.42). However, L can typically be chosen 
so that the error is kept acceptably small. It is important to emphasize that the 
nature of the plant and the estimator are quite different. The plant is a physical 
system such as a chemical process or servomechanism whereas the estimator is 
usually an electronic unit computing the estimated state according to Eq. (2.40). 

The selection of L is approached in exactly the same fashion as K is selected 
in the control-law design. If we specify the desired location of the estimator error 


poles as 
s, = B. By. B, 
then the desired estimator characteristic equation is 
o,(5) = (s — B,)(s — B) (s — &,. (2.44) 


We can solve for L by comparing coefficients in Eq. (2.43) and Eq. (2.44). 

As in the control case, this is almost never done by hand. Rather, the func- 
tions acker.m and place.m are used, but with a slight twist. The transpose of 
Eq. (2.43) is 


det[sI — (F* — H™L™)] = 0, (2.45) 


and we now see that this is identical in form to Eq. (2.37) where K and L” play 
the same role. Therefore, we compute L to achieve estimator poles at the desired 
location, p, by typing in MATLAB 


L = acker(F, H, p). or L-place(F, H’, p)' 


observability 


optimal estimation 
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where F” is indicated in MATLAB as F', etc. 

There will be a unique solution for L for a SISO system provided that the 
system is observable. Roughly speaking, observability refers to our ability to 
deduce information about all the modes of the system by monitoring only the 
sensed outputs. Unobservability results when some mode or subsystem has no 
effect on the output. 

The selection of the estimator poles that determine L are generally chosen 
to be a factor of 2 to 6 faster than the controller poles. This ensures a faster 
decay of the estimator errors compared with the desired dynamics, thus causing 
the controller poles to dominate the total system response. If sensor noise is 
particularly large, it sometimes makes sense for the estimator poles to be slower 
than two times the controller poles, which would yield a system with lower 
bandwidth and more noise smoothing. On the other hand, the penalty in making 
the estimator poles too fast is that the system becomes more noise sensitive. 

The tradeoff between fast and slow estimator roots can also be made using 
results from optimal estimation theory. First, let's consider that there is arandom 
input affecting the plant, called process noise, w, that enters Eq. (2.1) as 


X = Fx + Gu - G,w, (2.46) 
and a random sensor noise, v entering Eq. (2.1) as 
y = Hx +v. (2.47) 
The estimator error equation with these additional inputs is 
X = (F - LH)& + G,w — Lv. (2.48) 


In Eq. (2.48) the sensor noise is multiplied by L and the process noise is not. 
If L is very small, then the effect of sensor noise is removed but the estimator's 
dynamic response will be “slow”, so the error will not reject effects of w very 
well. The state of a low-gain estimator will not track uncertain plant inputs very 
well or plants with modeling errors. On the other hand, if L is "large", then 
the estimator response will be fast and the disturbance or process noise will be 
rejected, but the sensor noise, multiplied by L, results in large errors. Clearly, a 
balance between these two effects is required. 

It turns out that the optimal solution to this balance can be found as a 
function of the process noise intensity, R,, and the sensor noise intensity, Rs, 
both of which are scalars for the SISO case under consideration. Since the only 
quantity affecting the result is the ratio R/R, it makes sense to let R, = 1 
and vary R, only. An important advantage of using the optimal solution is that 
only one parameter, R „ needs to be varied by the designer rather than picking n 
estimator poles for an n'"-order system. The solution is calculated by MATLAB as 


L = kalman(sys, Rw,Rv). 


48 Chapter 2 Review of Continuous Control 


2.6.3 


2.6.4 


Figure 2.19 
Estimator and controller 
mechanization 


Compensation: Combined Control 
and Estimation 


We now put all this together, ignoring for the time being the effect of acommand 
input, r. If we take the control law (Eq. 2.35), combine it with the estimator 
(Eq. 2.40), and implement the control law using the estimated state elements, the 
design is complete and the equations describing the result are 


&—(F—GK — LH)& + Ly, 
u = —K&. (2.49) 


These equations describe what we previously called compensation; that is, the 
control, 4, is calculated given the measured output, y. Figure 2.19 shows schemat- 
ically how the pieces fit together. The roots of this new closed-loop system can 
be shown to consist of the chosen roots of the controller plus the chosen roots 
of the estimator that have been designed in separate procedures in Sections 2.6.1 
and 2.6.2. The poles and zeros of the compensator alone could be obtained by 
examining the system described by Eq. (2.49); however, that step need not be car- 
ried out unless the designer is curious how the compensation from this approach 
compares with compensation obtained using a transform based design method. 


Reference Input 


One obvious way to introduce a command input is to subtract y from r in exactly 
the same way it has been done for the transform design methods discussed previ- 
ously. This scheme is shown schematically in Fig. 2.20(b). Using this approach, 
a step command in r enters directly into the estimator, thus causing an estimation 
error that decays with the estimator dynamic characteristics in addition to the 
response corresponding to the control poles. 

An alternative approachconsists of entering the command r directly into the 
plant and estimator in an identical fashion as shown in Fig. 2.20(a). Since the 


Ww v 


y(t) 


Estimator 
x = Fx + Gu 
+ L(y — Hx) 


Figure 2.20 

Possible locations for 
introducing the 
reference input: 

(a) compensation in the 
feedback path, 

(b) compensation in the 
feedforward path 
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(b) 


command creates a step in u that affects the plant and estimator in an identical 
fashion, both respond identically, and no estimator error is induced. Therefore, 
there are no estimator error characteristics in the response and the total response 
consists of controller characteristics only. This approach is usually superior. 

The feedforward gain, N, can be computed so that no steady-state error 
exists. Its value is based on computing the steady-state value of the control, u,., 
and the steady-state values of the state, x.., that result in no steady-state error, e. 
The result is 


N=N,_+KN, (2.50) 
X]-[s 2] H1] 
N] .H J 1} 


In many cases, it is difficult to obtain an accurate value for the plant gain, in 
part because plants are typically nonlinear and the plant model is a linearization 
at a particular point. Therefore, the value of N will not be accurate and steady- 
state errors will result even though the model is sufficiently accurate for good 


where 


Integral Control 


50 Chapter2 Review of Continuous Control 


feedback control design. The solution is to incorporate an integral control term 
in the feedback similar to the integral control discussed in Section 2.2.3. 

Integral control is accomplished using state-space design by augmenting the 
state vector with the desired integral x,. It obeys the differential equation 


x,=Hx-r (=e). 


[i 
z= edt. 


This equation is augmented to the state equations (Eq. 2.1) and they become 


defo Fle ]+Lele-[o}: e 


The feedback law is 


Thus 


u=-[K, Ki]. 


i--K[7]. 
x 


With this revised definition of the system, the design techniques from Section 
2.6.1 can be applied in a similar fashion. The elements of K obtained are imple- 
mented as shown in Fig. 2.21. 


or simply 


2.7 Summary 


Figure 2.21 


integral control structure 


e System dynamics can be represented by a state-space description, Eq. (2.1), 
or by a transfer function, Eqs. (2.6) or (2.7). 


2.7 Summary 51 


* The key property of the Laplace transform that allows solution of differential 
equations is Eq. (2.5) 


Lif (0) = s FG). 


A system's output can be determined by the inverse Laplace transform for 
very simple cases or, more often the case, by numerical methods such as 
impulse.m, step.m, or lsim.m in MATLAB. 


If a system’s output is described by X(s) and is stable, the Final Value 
Theorem states that 


lim x(t) =x, = lim s X (s). 


too 


One can associate certain time response behavior with pole locations in the 
s-plane as summarized in Fig. 2.5. 


Control system specifications are usually defined in terms of the rise time t, 
settling time r,, and overshoot M, which are defined by Eqs. (2.16)-(2.18). 


For an open loop system given by D(s)G(s), the closed loop system as 

defined by Fig. 2.8 is given by Eq. (2.19) 
Y(s) e» D(s)G(s) 
R(s) 1-4 D(s)G(s) 


zu. 


The basic types of feedback are proportional, integral, and derivative, and 
are defined by Eqs. (2.22)- (2.24). 


The root locus is a method to sketch the location of the closed-loop roots 
of a system vs. some parameter of interest, usually the feedback gain. It is 
based on phase considerations which can easily be determined graphically 
by hand, and are therefore very useful in checking computer based results. 


The frequency response of the open-loop transfer function of a system can 
be easily analyzed to determine the stability of the related closed-loop sys- 
tem. The open-loop transfer function can be determined experimentally or 
analytically. 


Design of control systems using the state space approach is carried out by 
specifying the desired closed-loop root location, called pole-placement, or 
by selecting weighting matrices in a cost function, called optimal or LQR 
control. Either method tends to reduce the design iterations required over 
root locus or frequency response design, especially for higher order systems 
and those with multiple inputs and/or outputs. 


State space design requires that all elements of the state vector are available 
for the control; therefore, they must be measured directly or estimated using 
measurements of a portion of the state vector. Pole placement or optimal 
methods can also be used to arrive at the best estimator for this purpose. 
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2.8 Problems 


2.1 


2.2 


2.3 


2.4 


2.5 


2.6 


Design feedback with lead compensation for the open-loop system 
10 
G(s) 2 —. 
(s) E, 
The rise time should be | sec or less and the overshoot should be less than 10%. 
Design feedback with lead compensation for the open-loop system 
3 
G(s) = 5. 
( pa 
The bandwidth should be faster than | rad/sec and the phase margin should be better than 
50°. 
For the open-loop system 
2 
G(s) =, 
(s) z 


(a) design feedback assuming you have access to all the state elements. Ensure that 
there are closed-loop system poles at s = —3 + 3j. 


(b) Design an estimator for the system so that it has poles at s = —6 + 6j. 


(c) Find the transfer function of the complete controller consisting of the control from 
part (a) and the estimator from part (b). 


For the open-loop system 


“Gps s(s +4)’ 


(a) design feedback assuming you have access to all the state elements. Ensure that 
there are closed-loop system poles that provide a natural frequency of w, = 3 
rad/sec with £ = 0.5. 


(b) Design an estimator for the system so that it has poles that provide a natural 
frequency of w, = 6 rad/sec with ¢ = 0.5. 


(c) Find the transfer function of the complete controller consisting of the control from 
part (a) and the estimator from part (b). 


Can you stabilize the system 
M l 
^os +25) 


with a single lead compensation? If you can, do it. If you can’t, show why not. 


G(s) 


For the open-loop system 
l 
~ s 25) 
(a) design feedback assuming you have access to all the state elements. Place the 
closed-loop system poles ats = —1 + 1j, —0.5 x 5j. 


G(s) 


(b) Design an estimator for the system so that it has poles at s = —2 + 2j, —2 + 8j. 


(c) Find the transfer function of the complete controller consisting of the control from 
part (a) and the estimator from part (b). 


2.7 


2.8 


2.9 
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Consider a pendulum with control torque T, and disturbance torque T, whose differential 
equation is 
0-46 =T, +T}. 
Assume there is a potentiometer at the pin that measures the output angle 6, that is, 
y=. 
(a) Design a lead compensation using a root locus that provides for an M,« 1096 and 
arise time, r, « 1 sec. 


(b) Add an integral term to your controller so that there is no steady-state error in the 
presence of a constant disturbance, 7}, and modify the compensation so that the 
specifications are stil] met. 


Consider a pendulum with control torque 7. and disturbance torque T, whose differential 
equation is 

6+ 46 = T. +T; 
Assume there is a potentiometer at the pin that measures the output angle 0, that is, 
y=. 
(a) Design a lead compensation using frequency response that provides for a PM > 50? 


and a bandwidth, w,,, > 1 rad/sec. 


(b) Add an integral term to your controller so that there is no steady-state error in the 
presence of a constant disturbance, T}, and modify the compensation so that the 
specifications are still met. 


Consider a pendulum with control torque 7. and disturbance torque T, whose differential 
equation is 


Ö +40 =T, +T). 


Assume there is a potentiometer at the pin that measures the output angle 6, that is, 


[s]: 


write the system equations in state form. Give values for the matrices F, G, H. 


(a) Taking the state vector to be 


(b) Show, using state-variable methods, that the characteristic equation of the model is 
2 


(c) Write the estimator equations for 
6 
ó 
Pick estimator gains [L,, ix to place both roots of the estimator-error 
characteristic equation at s — —10. 


(d) Using state feedback of the estimated state variables 0 and 6, derive a control law 
to place the closed-loop control poles at s = —2 + 2j. 


(e) Draw a block diagram of the system, that is, estimator, plant, and control law. 


(f) Demonstrate the performance of the system by plotting the step response to a 
reference command on (i) 6, and (ii) T}. 
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2.10 


2.11 


2.12 


2.13 


2.14 


2.15 


2.16 


2.17 


(g) Design a controller with an integral term and demonstrate its performance to the 
step inputs as in (f). 


For the open-loop system 
3 
G(s) = ————. 
8) s? +25 —3 
determine 
(a) the final value to a unit step input. 
(b) Answer (a) for the case where 
3 
G(s) = ———— 
(s) gs? +2543 
For the open-loop system 
3 
G(s) = ————, 
(s) s? +2s -3 


assume there is a feedback with a proportional gain, K, and sketch a locus of the closed- 
loop roots vs. K. What is the minimum value of K to achieve a stable system? 


For the open-loop system 
l 
meua Na TET 
s^(s^ +2s + 100) 


use a single lead compensation in the feedback to achieve as fast a response as possible, 
keeping the damping of the resonant mode better than ¢ = 0.05. 


G(s) 


Sketch the locus of roots vs. the parameter b for 
s? +bs+b+1=0. 
Sketch the root locus with respect to K for the open-loop system 


K(s + 3) 
G(s) = ———_—: 
s(s + 2)(s + 1) 
After completing the hand sketch, verify your result using MATLAB. 


Sketch the root locus with respect to K for the open-loop system 
K(s + 2) 
G(s) = E 


After completing the hand sketch, verify your result using MATLAB. 


Sketch the root locus with respect to K for the open-loop system 
K(s + 2) 


After completing the hand sketch, verify your result using MATLAB. 
Sketch the root locus with respect to K for the open-loop system 
K 1 
G(s) = eee EN 
s(s + 2)(s° + 25) 
After completing the hand sketch, verify your result using MATLAB. 
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2.18 Sketch a Bode plot for the open-loop system 
0.1 
G(s) = —— es) x ) i 
s(s + 1)(s* + 2s + 100) 
After completing the hand sketch, verify your result using MATLAB. With unity feedback, 
would the system be stable? 
2.19 Sketch a Bode plot for the open-loop system 
G(s) = 100(s + 1) 
c s*(s + 10) 
After completing the hand sketch, verify your result using MATLAB. With unity feedback, 
would the system be stable? What is the PM? 
2.20 Sketch a Bode plot for the open-loop system 
Gs) == 5000(s + 1) 
s°(s + 10)(s + 50) 
After completing the hand sketch, verify your result using MATLAB. With unity feedback, 
would the system be stable? If not, how would you stabilize it? 


* * 


Introductory Digital Control 


A Perspective on Introductory Digital Control 


The continuous controllers you have studied so far are built using analog elec- 
tronics such as resistors, capacitors, and operational amplifiers. However, most 
control systems today use digital computers (usually microprocessors or 
microcontrollers) with the necessary input/output hardware to implement the 
controllers. The intent of this chapter is to show the very basic ideas of design- 
ing control laws that will be implemented in a digital computer. Unlike analog 
electronics, digital computers cannot integrate. Therefore, in order to solve a dif- 
ferential equation in a computer, the equation must be approximated by reducing 
it to an algebraic equation involving sums and products only. These approxi- 
mation techniques are often referred to as numerical integration. This chapter 
shows a simple way to make these approximations as an introduction to digital 
control. Later chapters expand on various improvements to these approximations, 
show how to analyze them, and show that digital compensation may also be car- 
ried out directly without resorting to these approximations. In the final analysis, 
we will see that direct digital design provides the designer with the most accurate 
method and the most flexibility in selection of the sample rate. 

From the material in this chapter, you should be able to design and imple- 
ment a digital control system. The system would be expected to give adequate 
performance if the sample rate is at least 30 times faster than the bandwidth of 
the system. 


Chapter Overview 


In Section 3.1, you will learn how to approximate a continuous D(s) with a set 
of difference equations, a design method sometimes referred to as emulation. 
Section 3.1 is sufficient to enable you to approximate a continuous feedback 
controller in a digital control system. Section 3.2 shows the basic effect of 
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3.1 


Figure 3.1 

Basic control-system 
block diagrams: 

(a) continuous system, 
(b) with a digital 
computer 


sampling on the performance of the system and a simple way to analyze that 
effect. Section 3.3 shows how to convert a continuous PID control law to the 
digital form. 


Digitization 


Figure 3.1(a) shows the topology of the typical continuous system. The com- 
putation of the error signal, e, and the dynamic compensation, D(s), can all be 
accomplished in a digital computer as shown in Fig. 3.1(b). The fundamental 
differences between the two implementations are that the digital system operates 
on samples of the sensed plant output rather than on the continuous signal and 
that the dynamics represented by D(s) are implemented by algebraic recursive 
equations called difference equations. 

We consider first the action of the analog-to-digital (A/D) converter on a 
signal. This device acts on a physical variable, most commonly an electrical 
voltage, and converts it into a binary number that usually consists of 10 or 12 
bits. A binary number with 10 bits can take on 2'° = 1024 values; therefore, 
an A/D converter with 10 bits has a resolution of 0.1%. The conversion from 
the analog signal y(t) occurs repetitively at instants of time that are T seconds 


r(t) y(t) 


Sensor 


sample period 
sample rate 


ZOH 


Euler's method 


difference equations 
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apart. T is called the sample period and 1/ T is the sample rate in cycles per 
second or Hz (also sometimes given in radians/second or 2x:/ T). The sampled 
signal is y(kT) where k can take on any integer value. It is often written simply 
as y(k). We call this type of variable a discrete signal to distinguish it from a 
continuous variable like y(t), which changes continuously in time. We make the 
assumption here that the sample period is fixed; however, it may vary depending 
on the implementation as discussed in Section 1.1. 

There also may be a sampler and A/D converter for the input command, r (1), 
producing the discrete r (kT) from which the sensed y (k T) would be subtracted 
to arrive at the discrete error signal, e(kT). The differential equation of the 
continuous compensation is approximated by a difference equation which is the 
discrete approximation to the differential equation and can be made to duplicate 
the dynamic behavior of a D(s) if the sample period is short enough. The result 
of the difference equation is a discrete u(kT) at each sample instant. This signal 
is converted to a continuous u(t) by the D/A and hold. The D/A converts the 
binary number to an analog voltage, and a zero-order hold (ZOH) maintains that 
same voltage throughout the sample period. The resulting u(t) is then applied to 
the actuator in precisely the same manner as the continuous implementation. 

One particularly simple way to make a digital computer approximate the real 
time solution of differential equations is to use Euler’s method. It follows from 
the definition of a derivative that 

acm ôx ! 
Bm (3.1) 
where ôx is the change in x over a time interval dt. Even if ôt is not quite equal 
to zero, this relationship will be approximately true, and 
x(k) = et uS (3.2) 
iN 
where 


T=1,,, — t, (the sample interval in seconds), 
t, = kT (for a constant sample interval), 

k is an integer, 

x(k) is the value of x at t,, and 

x(k + 1) is the value of x at t,,,. 


This approximation’ can be used in place of all derivatives that appear in the 
controller differential equations to arrive at a set of equations that can be solved 
by a digital computer. These equations are called difference equations and are 
solved repetitively with time steps of length T. For systems having bandwidths 


| This particular version is called the forward rectangular rule. See Problem 3.2 for the backward 
rectangular version. 
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of a few Hertz, sample rates are often on the order of 100 Hz, so that sample 
periods are on the order of 10 msec and errors from the approximation can be 
quite small. 


€ Example 3.1 


Difference Equations Using Eulers Method 


Using Euler's method, find the difference equations to be programmedinto the control computer 
in Fig. 3.1(b) for the case where the D(s) in Fig. 3.1(a) is 


DOES =% (3.3) 


Solution. Firstfindthe differential equation thatcorresponds to D(s). After cross multiplying 
Eq. (3.3) to obtain 


(s + b)U(s) = K,(s + a) E(s), 
we can see by inspection that the corresponding differential equation is 
ú + bu = K (ė + ae). (3.4) 


Using Euler’s method to approximate Eq. (3.4) according to Eq. (3.2), we get the approximating 
difference equation 


1) — u(k = 
u(k +1) — u(k) FURER elk + 1) — e(k) + ae] (3.5) 
T 2 T 
Rearranging Eq. (3.5) puts the difference equation in the desired form 
u(k +1) 2 u(k) - T E "uit (“oa + set) | : (3.6) 


Equation (3.6) shows how to compute the new value of the control, u(k + 1), given the past 
value of the control, u(k), and the new and past values of the error signal, e(k + 1) and e(k). 
For computational efficiency, it is convenient to re-arrange Eq. (3.6) to 


u(k +1) = ( — bT)u(k) + K (aT — De(k) + Kelk + 1). (3.7) 


* 


In principle, the difference equation is evaluated initially with k = 0, then 
k = 1,2, 3,.... However, there is usually no requirement that values for all 
times be saved in memory. Therefore, the computer need only have variables 
defined for the current and past values for this first-order difference equation. 
The instructions to the computer to implement the feedback loop in Fig. 3.1(b) 
with the difference equation from Eq. (3.7) would call for a continual looping 
through the code in Table 3.1. Note in the table that the calculations have been 
arranged so as to minimize the computations required between the reading of 
the A/D and the writing to the D/A, thus keeping the computation delay to a 
minimum. 


Table 3.1 
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Real Time Controller implementation 


€ Example 3.2 


x = 0 (initialization of past values for first loop through) 
Define constants: 

a, =1-—bT 

a,=K ACI — 1) 

READ A/D to obtain y andr 

e=r-y 

u=x+K,e 

OUTPUT u to DIA and ZOH 

now compute x for the next loop through 

x =a,Uta,e 

go back to READ when T seconds have elapsed since last READ 


The sample rate required depends on the closed-loop bandwidth of the sys- 
tem. Generally, sample rates should be faster than 30 times the bandwidth in order 
to assure that the digital controller can be made to closely match the performance 
of the continuous controller. Discrete design methods described in later chapters 
will show how to achieve this performance and the consequences of sampling 
even slower if that is required for the computer being used. However, when using 
the techniques presented in this chapter, a good match to the continuous con- 
troller is obtained when the sample rate is greater than approximately 30 times 
the bandwidth. 


Lead Compensation Using a Digital Computer 


Find digital controllers to implement the lead compensation 


s+2 
D(s) = 70— — 3.8 
BT EMO pa 
for the plant 
G -= 
(s) s(s + 1) 


using sample rates of 20 Hz and 40 Hz. Implement the control equations on an experimental 
laboratory facility like that depicted in Fig. 3.1, that is, one that includes a microprocessor for 
the control equations, a ZOH, and analog electronics for the plant. Compute the theoretical 
step response of the continuous system and compare that with the experimentally determined 
step response of the digitally controlled system. 


Solution. Comparing the compensation transfer function in Eq. (3.8) with Eq. (3.3) shows 
that the values of the parameters in Eq. (3.6) are a = 2, b = 10, and K, = 70. For a sample 
rate of 20 Hz, T = 0.05 sec and Eq. (3.6) can be simplified to 


u(k + 1) = 0.5u(k) + 70[e(k + 1) — O.9e(&)]. 
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For a sample rate of 40 Hz, T = 0.025 sec and Eq. (3.6) simplifies to 
u(k + 1) = 0.75u(k) + 70[e(k + 1) — 0.95e(k)]. 
The statements in MATLAB to compute the continuous step response is 
numD = 70*[1 2] denD=[1 10] 
numG=1, denG=[1 1 90] 
sys1 = tf(numD,denD)*tf(numG,denG) 
sysCL = feedback(sys1, 1) 
step(sysCL). 


Figure 3.2 shows the step response of the two digital controllers compared to the continuous 
step response. Note that the 40 Hz sample rate (about 30 x bandwidth) behaves essentially like 
the continuous case, whereas the 20 Hz sample rate (about 15 x bandwidth) has a detectable 
increased overshoot signifying some degradation in the damping. The damping would degrade 
further if the sample rate were made any slower. 

The MATLAB file that created Fig. 3.2 (fig32.m) computed the digital responses as well 
as the continuous response. You will learn how to compute the response of a digital system in 
Chapter 4. 


* 


Figure 3.2 
Continuous and digital step response using Euler's method for discretization: (a) 20 Hz 
sample rate, (b) 40 Hz sample rate 
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Figure 3.3 
The delay due to the 
hold operation 
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In Chapter 6, you will see that there are several ways to approximate a 
continuous transfer function, each with different merits, and most with better 
qualities than the Euler method presented here. In fact, MATLAB provides a 
function (c2d.m) that computes these approximations. However, before those 
methods can be examined, it will be necessary to understand discrete transfer 
functions, a topic covered in Chapter 4. 


Effect of Sampling 


It is worthy to note that the single most important impact of implementing a 
control system digitally is the delay associated with the hold. A delay in any 
feedback system degrades the stability and damping of the system. Because each 
value of u(kT) in Fig. 3.1(b) is held constant until the next value is available 
from the computer, the continuous value of u(t) consists of steps (see Fig. 3.3) 
that, on the average, lag u(kT) by T/2, as shown by the dashed line in the figure. 
By incorporating a continuous approximation of this 7/2 delay in a continuous 
analysis of the system, an assessment can be made of the effect of the delay in 
the digitally controlled system. The delay can be approximated by the method of 
Padé. The simplest first-order approximation is 
2/T 
sT2/T. 

Figure 3.4 compares the responses from Fig. 3.2 with a continuous analysis 
that includes a delay approximation according to Eq. (3.9). 

This linear approximation of the sampling delay (Eq. (3.9)) could also be 
used to determine the effect of a particular sample rate on the roots of a system 
via linear analysis, perhaps a locus of roots vs. T. Alternatively, the effect of a 
delay can be analyzed using frequency response techniques because a time delay 
of T/2 translates into a phase decrease of 


G,(s) = (3.9) 


wT 
ô$ = aor (3.10) 
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Figure 3.4 

Continuous and digital 
step response at 20 Hz 
sample rate showing 
results with a 7/2 delay 
approximation 
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Time (sec) 


Thus, we see that the loss of phase margin due to sampling can be estimated 
by invoking Eq. (3.10) with w equal to the frequency where the magnitude equals 
one, that is, the “gain crossover frequency.” 


€ Example 3.3 


Approximate Analysis of the Effect of Sampling 


For the system in Example 3.2, determine the decrease in damping that would result from 
sampling at 10 Hz. Use both linear analysis and the frequency response method. Compare 
the time response of the continuous system with the discrete implementation to validate the 
analysis. 


Solution. The damping of the system in Example 3.2 can be obtained from the MATLAB 
statement 


damp(sysCL) 


where SysCL is that computed in Example 3.2. The result is ¢ = .56. 
The damping of the system with the simple delay approximation added (Eq. (3.9)) is 
obtained from 


T= 1/10 

numDL = 2/7, denDL={1  2/T] 
Sys2 = tf(numDL,denDL)*sys1 
SysCL = feedback(sys2, 1) 
damp(sysCL) 


where Sys1 is that computed in Example 3.2. The result of this calculation is ¢ = .33. 

The frequency response of the continuous system is shown by the solid line in Fig. 3.5 and 
shows that the crossover frequency is about 6 rad/sec and the PM is about 50^. The line of small 
circles shows the phase corrected by Eq. (3.10) and, therefore, that the PM decreases to about 
30°. For more precision, the use of margin.m in MATLAB shows that the continuous system 
has a PM of 49.5" at a crossover frequency of 6.17 rad/sec. Equation (3.10) then indicates that 
the correction due to sampling should be 17.7°, thus the PM of the digital system would be 


3.2 Effect of Sampling 65 


31.8°. Since the PM is approximately 100 x Z, this analysis shows that the ¢ decreases from 
approximately 0.5 for the continuous system to 0.32 for the digital system. 

Both analysis methods indicate a similar reduction in the damping of the system. One 
should, therefore, expect that the overshoot of the step response should increase. For the case 
with no zeros, Fig. 2.7 indicates that this decrease in Z should result in the step response 
overshoot, M,, going from 16% to 35% for a 2nd-order system with no zeros. The actual step 
responses in Fig. 3.6 have about 20% overshoot for the continuous system and about 30% for 
the digital case. So, we see that the approximate analysis was somewhat conservative in the 
prediction of the decreased damping and increased overshoot in the digital case. The trend 
that decreasing sample rate causes decreasing damping and stability will be analyzed more 
completely throughout the book. 


Figure 3.5 
Frequency response for 
Example 3.3 


Figure 3.6 
Continuous and digital 
responses for Example 
3.3 (at 10 Hz sample 
rate) 
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3.3 


PID Control 


The notion of proportional, integral, and derivative (PID) control is reviewed in 
Section 2.2.3. Reviewing again briefly, the three terms are proportional control 


u(t) = Ke(t), (3.11) 
integral control 
u(t) = = Í e(n)dn, (3.12) 
T, Jo 
and derivative control 
u(t) = KT, e(t), (3.13) 


where K is called the proportional gain, 7, the integral time, and T, the derivative 
time. These three constants define the control. 

The approximations of these individual control terms to an algebraic equation 
that can be implemented in a digital computer are proportional control 


u(k) = Ke(k), (3.14) 
integral control 
u(k) = u(k—1) + pre, (3.15) 
and derivative control 
u(k) = Ate [e(k) — e(k — 1)]. (3.16) 


Equation (3.11) is already algebraic, therefore Eq. (3.14) follows directly while 
Eqs. (3.15) and (3.16) result from an application of Euler's method (Eq. (3.2)) to 
Eqs. (3.12) and (3.13). However, normally these terms are used together and, in 
this case, the combination needs to be done carefully. The combined continuous 
transfer function (Eq. 2.24) is 

u(s) 


1 


Therefore, the differential equation relating u(t) and e(t) is 
1 
ú = K (é+ —e + T ë) 
T, 


and the use of Euler’s method (twice for ë) results in 


a Ft ee = LATET NR CN 
u(k) — u(k n+a[(1+ 742) et (14272) e 1) + Petk »| 


(3.17) 


€ Example 3.4 
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Transforming a Continuous PID to a Digital Computer 


A micro-servo motor has a transfer function from the input applied voltage to the output speed 
(rad/sec), 


360000 
G(9)— F606 600)" 


It has been determined that PID control with K = 5, T,, = 0.0008 sec, and T,= 0.003 sec gives 
satisfactory performance for the continuous case. Pick an appropriate sample rate, determine 
the corresponding digital control law, and implement on a digital system. Compare the digital 
step response with the calculated response of a continuous system. Also, separately investigate 
the effect of a higher sample rate and re-tuning the PID parameters on the ability of the digital 
system to match the continuous response. 


(3.18) 


Solution. The sample rate needs to be selected first. But before we can do that, we need to 
know how fast the system is or what its bandwidth is. The solid line in Fig. 3.7 shows the step 
response of the continuous system and indicates that the rise time is about 1 msec. Based on 
Eq. (2.16), this suggests that c», = 1800 rad/sec, and so the bandwidth would be on the order 
of 2000 rad/sec or 320 Hz. Therefore, the sample rate would be about 3.2 kHz if 10 times 
bandwidth. So let's pick T — 0.3 msec. Use of Eq. (3.17) results in the difference equation 


u(k) = u(k — 1) + S[3.7667e(k) — 6.3333e(k — 1) + 2.6667e(k — 2)] 


which, when implemented in the digital computer results in the line with stars in Fig. 3.7. 
This implementation shows a considerably increased overshoot over the continuous case. The 
line with circles in the figure shows the improved performance obtained by increasing the 
sample rate to 10 kHz; i.e., a sample rate about 30 times bandwidth, while using the same PID 
parameters as before. It shows that the digital performance has improved to be essentially the 
same as the continuous case. 

Increasing the sample rate, however, will increase the cost of the compuler and the 
A/D converter; therefore, there will be a cost benefit by improving the performance while 
maintaining the 3.2 kHz sample rate. A look at Fig. 3.7 shows that the digital response 
(T = 0.3 msec) has a faster rise time and less damping than the continuous case. This suggests 
that the proportional gain, K, should be reduced to slow the system down and the deriva- 
tive time, T,, should be increased to increase the damping. Some trial and error, keeping 
these ideas in mind, produces the results in Fig. 3.8. The revised PID parameters that pro- 
duced these results are K = 3.2 and T, — 0.0011 sec. The integral reset time, T,, was left 
unchanged. 


+ 


This example once again showed the characteristics of a digital control 
system. The damping was degraded an increasing amount as the sample rate 
was reduced. Furthermore, it was possible to restore the damping with suitable 
adjustments to the control. 
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Figure 3.7 
Step response of a 


micro-motor, Example 


3.4, same PID 
parameters 


Figure 3.8 


Effect of PID tuning on 
the digital response, 
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e Digitization methods allow the designer to convert a continuous compensa- 
tion, D(s), into a set of difference equations that can be programmed directly 


into a control computer. 
e Euler’s method can be used for the digitization 
x(k + 1). — x(k) 


x(k) = 7 


(3.2) 


* As long as the sample rate is on the order of 30 x bandwidth or faster, the 
digitally controlled system will behave close to its continuous counterpart 
and the continuous analysis that has been the subject of your continuous 


control systems study will suffice. 


e For sample rates on the order of 10 to 30 times the bandwidth, a first order 
analysis can be carried out by introducing a delay of 7/2 in the continuous 
analysis to see how well the digital implementation matches the continuous 


analysis. A zero-pole approximation for this delay is 
2/T 


C, 5) = $42/T' 


(3.9) 
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The delay can be analyzed more accurately using frequency response where 
the phase from the continuous analysis should be decreased by 


bf = oF (3.10) 
2 
e A continuous PID control law whose transfer function is 
Dye K(1 e cabe 
e(s) T,s E 


can be implemented digitally using Eq. (3.17) 


u(k) =u(k -—1) + K (1 + z + z) e(k) — (: +212) e(k—1)+ To eck -2) : 
Ẹpo T T T 
The digital control system will behave reasonably close to the continuous 
system providing the sample rate is faster than 30 times the bandwidth. 

* In order to analyze the system accurately for any sample rate, but especially 
for sample rates below about 30 times bandwidth, you will have to proceed 
on to the next chapters to learn about z-transforms and how to apply them to 
the study of discrete systems. 

* For digital control systems with sample rates less than 30 times bandwidth, 
design is often carried out directly in the discrete domain, eliminating ap- 
proximation errors. 


3.5 Problems 


3.4 Do the following: 


(a) Design a continuous lead compensation for the satellite attitude control example 
(G(s) = l/s?) described in Appendix A.l so that the complex roots are at 
approximately s = —4.4+ j4.4 rad/sec. 


(b) Assuming the compensation is to be implemented digitally, approximate the effect 
of the digital implementation to be a delay of 7/2 as given by 


2/T 
$+2/T 


and determine the revised root locations for sample rates of w, = 5 Hz, 10 Hz, and 
20 Hz where T = 1/c, sec. 


3.2 Repeat Example 3.1 (from page 60), but use the approximation that 


dor ea (Bp PEEL) 

x(k) = T ; 
the backward rectangular version of Euler's method. Compare the resulting 
difference equations with the forward rectangular Euler method. Also compute the 
numerical value of the coefficients for both cases vs. sample rate for w, = 1 - 100 
Hz. Assume the continuous values from Eq. (3.8) The three coefficients of 
interest for both the forward and backward rectangular cases are the 
coefficients of u(k), e(k), and e(k+ 1). 


G, (s) = 
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3.3 


3.4 


3.5 


3.6 


3.7 


3.8 


3.9 


For the compensation 
541 
s+15’ 


use Euler’s forward retangular method to determine the difference equations for a digital 
implementation with a sample rate of 80 Hz. Repeat the calculations using the backward 
retangular method (see Problem 3.2) and compare the difference equation coefficients. 


D(s) 2 25 


For the compensation 

st2 

s+20° 

use Euler’s forward retangular method to determine the difference equations for a digital 


implementation with a sample rate of 80 Hz. Repeat the calculations using the backward 
retangular method (see Problem 3.2) and compare the difference equation coefficients. 


D(s) =5 


The read arm on a computer disk drive has the transfer function 
1000 


Design a digital PID controller that has a bandwidth of 100 Hz, a phase margin of 50°, 
and has no output error for a constant bias torque from the drive motor. Use a sample 
rate of 6 kHz. 


The read arm on a computer disk drive has the transfer function 


G(s) 2 —-. 
Design a digital controller that has a bandwidth of 100 Hz and a phase margin of 50°. 
Use a sample rate of 6 kHz. 


F Or 
G ( ) a 
Sha 3 


(a) design a continuous compensation so that the closed-loop system has a rise time 
t, < | sec and overshoot M, « 15% to a step input command, 

(b) revise the compensation so the specifications would still be met if the feedback was 
implemented digitally with a sample rate of 5 Hz, and 


(c) find difference equations that will implement the compensation in the digital 
computer. 


The read arm on a computer disk drive has the transfer function 

G(s) = 500. 

ZA 
Design a continuous lead compensation so that the closed-loop system has a bandwidth 
of 100 Hz and a phase margin of 50°. Modify the MATLAB file fig32.m so that you can 
evaluate the digital version of your lead compensation using Euler’s forward 
rectangular method. Try different sample rates, and find the slowest one where the 
overshoot does not exceed 30%. 
The antenna tracker has the transfer function 
10 

s(s +2) 


G(s) = 


3.10 
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Design a continuous lead compensation so that the closed-loop system has a rise time 
t, < 0.3 sec and overshoot M, < 10%. Modify the MATLAB file fig32.m so that you can 
evaluate the digital version of your lead compensation using Euler's forward rectangular 
method. Try different sample rates, and find the slowest one where the overshoot does 
not exceed 20%. 
The antenna tracker has the transfer function 

10 
s(s--2) 
Design a continuous lead compensation so that the closed-loop system has a rise time 


I, « 0.3 sec and overshoot M, « 10%. Approximate the effect of a digital implementa- 
tion to be 


G(s) = 


and estimate M, for a digital implementation with a sample rate of 10 Hz. 


+ + 


Discrete Systems Analysis 


4.1 


A Perspective on Discrete Systems Analysis 


The unique element in the structure of Fig. 3.1 is the digital computer. The 
fundamental character of the digital computer is that it takes a finite time to 
compute answers, and it does so at discrete steps in time. The purpose of this 
chapter is to develop tools of analysis necessary to understand and to guide the 
design of programs for a computer sampling at discrete times and acting as a 
linear, dynamic control component. Needless to say, digital computers can do 
many things other than control linear dynamic systems; it is our purpose in this 
chapter to examine their characteristics when doing this elementary control task 
and to develop the basic analysis tools needed to write programs for real-time 
computer control. 


Chapter Overview 


Section 4.1 restates the difference equations used by a computer to represent a 
dynamic system, a topic covered very briefly in Section 3.1. The tool foranalyzing 
this sort of system, the z-transform, is introduced and developed in Section 4.2. 
Use of the z-transform is developed further in Section 4.3 to show how it applies 
to the combined system in Fig. 3.1. Furthermore, state-space models of discrete 
systems are developed in this section. Section 4.4 shows the correspondence 
between roots in the z-plane and time response characteristics while Section 4.5 
discusses characteristics of the discrete frequency response. The last section, 4.6, 
derives properties of the z-transform. 


Linear Difference Equations 


We assume that the analog-to-digital converter (A/D) in Fig. 1.1 takes samples 
of the signal y at discrete times and passes them to the computer so that y(k T) = 


73 
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constant coefficients 


y(KT). The job of the computer is to take these sample values and compute in 
some fashion the signals to be put out through the digital-to-analog converter 
(D/A). The characteristics of the A/D and D/A converters will be discussed later. 
Here we consider the treatment of the data inside the computer. Suppose we call 
the input signals up to the kth sample ej, e,, €,, .... e,, and the output signals 
prior to that time uo, 4,, U3, ..., 4, ,. Then, to get the next output, we have the 
machine compute some function, which we can express in symbolic form as 


Wig os singe May MS (4.1) 


Because we plan to emphasize the elementary and the dynamic possibilities, 
we assume that the function f in Eq. (4.1) is linear and depends on only a finite 
number of past e’s and u’s. Thus we write 


U, = A, Ugi 78, 75000 7 0,44, 
Tes FD ups Digg: 
Equation (4.2) is called a linear recurrence equation or difference 
equation and, as we shall see, has many similarities with a linear differential 
equation. The name “difference equation" derives from the fact that we could 
write Eq. (4.2) using u, plus the differences in u,, which are defined as 


(4.2) 


Vu, =U, — Uy) 

2 -— _ 
Vu, = Vu, — Vu,_, 

n — r-l = -l 
Vu, = Vu, EUR 


(4.3) 


If we solve Eq. (4.3) for the values of u,, u, ,, and u, , in terms of differ- 
ences, we find 


-]* 


U, = U,» 

u,_, =u, — Vu, 
EU 2 

U, =U, 2Vu, +V u,- 


Thus, for a second-order equation with coefficients a,,a,, and b, (we let 
b, — b, — 0 for simplicity), we find the equivalent difference equation to be 


a, V u, — (a, + 2a,)Vu, + (a, +a, + 1)u, = bye,- 


Although the two forms are equivalent, the recurrence form of Eq. (4.2) 
is more convenient for computer implementation; we will drop the form using 
differences. We will continue, however, to refer to our equations as “difference 
equations.” If the a’s and b’s in Eq. (4.2) are constant, then the computer is solving 
a constant-coefficient difference equation (CCDE). We plan to demonstrate 
later that with such equations the computer can control linear constant dynamic 
systems and approximate most of the other tasks of linear, constant, dynamic 
systems, including performing the functions of electronic filters. To do so, it is 
necessary first to examine methods of obtaining solutions to Eq. (4.2) and to 
study the general properties of these solutions. 


Figure 4.1 


The Fibonacci numbers 
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To solve a specific CCDE is an elementary matter. We need a starting time 
(k-value) and some initial conditions to characterize the contents of the computer 
memory at this time. For example, suppose we take the case 


Uy = A, VEM, (4.4) 


and start at k = 2. Here there are no input values, and to compute u, we need 
to know the (initial) values for u, and u,. Let us take them to be ug — u, =l. 
The first nine values are 1, 1, 2, 3, 5, 8, 13, 21, 34.... A plot of the values of u, 
versus k is shown in Fig. 4.1. 

The results, the Fibonacci numbers, are named after the thirteenth-century 
mathematician! who studied them. For example, Eq. (4.4) has been used to 
model the growth of rabbits in a protected environment?. However that may be, 
the output of the system represented by Eq. (4.4) would seem to be growing, to 
say the least. If the response of a dynamic system to any finite initial conditions 
can grow without bound, we call the system unstable. We would like to be able 
to examine equations like Eq. (4.2) and, without having to solve them explicitly, 
see if they are stable or unstable and even understand the general shape of the 
solution. 


u(k) 


| Leonardo Fibonacci of Pisa, who introduced Arabic notation to the Latin world about 1200 A.D. 

2 Wilde (1964). Assume that u, represents pairs of rabbits and that babies are born in pairs. Assume that 
no rabbits die and that a new pair begin reproduction after one period. Thus at time &, we have all the 
old rabbits, «, ,, plus the newborn pairs born to the mature rabbits, which are u,_,. 
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characteristic equation 


One approach to solving this problem is to assume a form for the solution 
with unknown constants and to solve for the constants to match the given initial 
conditions. For continuous, ordinary, differential equations that are constant and 
linear, exponential solutions of the form e" are used. In the case of linear, constant, 
difference equations, it turns out that solutions of the form zt will do where z has 
the role of s and k is the discrete independent variable replacing time, 7. Consider 
Eq. (4.4). If we assume that u(k) = Az‘, we get the equation 


Az = Az} + Azk?, 


Now if we assume z $0 and A = 0, we can divide by A and multiply by z~“, 
with the result 


1=7 +z 
or 
z-—z-l. 


This polynomial of second degree has two solutions, z, , = 1/2 + A/5/2. Let's 
call these z, and z,. Since our equation is linear, a sum of the individual solutions 
will also be a solution. Thus, we have found that a solution to Eq. (4.4) is of the 
form 


u(k) = A,z," + Ag. 


We can solve for the unknown constants by requiring that this general solution 
satisfy the specific initial conditions given. If we substitute k = 0 and k = |, we 
obtain the simultaneous equations 


1=A,+A,, 
l = Az + Az, 
These equations are easily solved to give 
4. 
1 2/5 , 
i es 
UC eS 


And now we have the complete solution of Eq. (4.4) in a closed form. 
Furthermore, we can see that since z, = (1 + J5)/ 2 is greater than 1, the term 
in ra will grow without bound as k grows, which confirms our suspicion that 
the equation represents an unstable system. We can generalize this result. The 
equation in z that we obtain after we substitute u = z* is a polynomial in z 
known as the characteristic equation of the difference equation. If any solution 
of this equation is outside the unit circle (has a magnitude greater than one), 
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the corresponding difference equation is unstable in the specific sense that for 
some finite initial conditions the solution will grow without bound as time goes 
to infinity. If a// the roots of the characteristic equation are inside the unit circle, 
the corresponding difference equation is stable. 


€ Example 4.1 


Discrete Stability 
Is the equation 
u(k) = 0.9u(k — 1) — 0.2u(k — 2) 


stable? 


Solution. The characteristic equation is 
z’ — 0.92 + 0.2 = 0, 


and the characteristic roots are z = 0.5 and z = 0.4. Since both these roots are inside the unit 
circle, the equation is stable. 


Figure 4.2 

Plot of a function and 
alternative 
approximations to the 
area under the curve 
over a single time 
interval 


+ 


As an example of the origins of a difference equation with an external 
input, we consider the discrete approximation to integration. Suppose we have a 
continuous signal, e(t), of which a segment is sketched in Fig. 4.2, and we wish 
to compute an approximation to the integral 


Hey e(t) dt, (4.5) 
0 


using only the discrete values e(0), ..., e(t, ,), e(t,). We assume that we have 
an approximation for the integral from zero to the time 7, , and we call it u, ,. 
The problem is to obtain u, from this information. Taking the view of the integral 
as the area under the curve e(t), we see that this problem reduces to finding an 
approximation to the area under the curve between /, , and /,. Three alternatives 
are sketched in Fig. 4.2. We can use the rectangle of height e, ,, or the rectangle 
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Trapezoid rule 


4.2 


of height e,, or the trapezoid formed by connecting e,_, to e, by a straight line. 
If we take the third choice, the area of the trapezoid is 


i—1 
A ate, Kera. (4.6) 


Finally, if we assume that the sampling period, 7, — t,_,, is a constant, T, we are 
led to a simple formula for discrete integration called the trapezoid rule 


T 
u, = V + 3 d (4.7) 


If e(t) — t, then e, = kT and substitution of u, = (T?/2)k? satisfies Eq. (4.7) 
and is exactly the integral of e. (It should be, because if e(r) is a straight line, 
the trapezoid is the exact area.) If we approximate the area under the curve by 
the rectangle of height e, ,, the result is called the forward rectangular rule 
(sometimes called Euler’s method, as discussed in Chapter 3 for an approximation 
to differentiation) and is described by 


ú, =U . Te, ,- 
The other possibility is the backward rectangular rule, given by 
u,=u,_,+Te,. 


Each of these integration rules is a special case of our general difference equation 
Eq. (4.2). We will examine the properties of these rules later, in Chapter 6, while 
discussing means to obtain a difference equation that will be equivalent to a given 
differential equation. 

Thus we see that difference equations can be evaluated directly by a digital 
computer and that they can represent models of physical processes and approxi- 
mations to integration. It turns out that if the difference equations are linear with 
coefficients that are constant, we can describe the relation between u and e by a 
transfer function, and thereby gain a great aid to analysis and also to the design 
of linear, constant, discrete controls. 


The Discrete Transfer Function 


We will obtain the transfer function of linear, constant, discrete systems by the 
method of z-transform analysis. A logical alternative viewpoint that requires a 
bit more mathematics but has some appeal is given in Section 4.6.2. The results 
are the same. We also show how these same results can be expressed in the state 
space form in Section 4.2.3. 


4.2.1 
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The z-Transform 


If a signal has discrete values e}, €}, ..., €. ... we define the z -transform of the 
signal as the function ?* 


E(z) 


n 


Z{e(k)} 
B ues el = Ri, (4.8) 


k--—oo 
and we assume we can find values of r, and R, as bounds on the magnitude of 
the complex variable z for which the series Eq. (4.8) converges. A discussion of 
convergence is deferred until Section 4.6. 


Ip 


€ Example 4.2 


The z-Transform 


The data e, are taken as samples from the time signal e7% 1(r) at sampling period T where I(r) 
is the unit step function, zero for £ < 0, and one for ¢ > 0. Then e, = e **7 | (KT). Find the 
z-transform of this signal. 


Solution. Applying Eq. (4.8), we find that 


oo oo 
Y get= ee 


k=—00 0 


T Yee 
0 


* 


We will return to the analysis of signals and development of a table of useful 
z-transforms in Section 4.4; we first examine the use of the transform to reduce 


3 We use the notation = to mean “is defined as.” 


4 In Eq. (4.8) the lower limit is —oo so that values of e, on both sides of k = O are included. The 
transform so defined is sometimes called the two-sided z-transform to distinguish it from the one-sided 
definition, which would be Doo ener. For signals that are zero for k < 0, the transforms obviously 
give identical results. To take the one-sided transform of u, ,, however, we must handle the value of 
t_,, and thus are initial conditions introduced by the one-sided transform. Examination of this property 
and other fcatures of the one-sided transform are invited by the problems. We select the two-sided 
transform because we need to consider signals that extend into negative time when we study random 
signals in Chapter 12. 
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4.2.2 


difference equations to algebraic equations and techniques for representing these 
as block diagrams. 


The Transfer Function 


The z-transform has the same role in discrete systems that the Laplace transform 
has in analysis of continuous systems. For example, the z-transforms for e, and 
u, in the difference equation (4.2) or in the trapezoid integration (4.7) are related 
in a simple way that permits the rapid solution of linear, constant, difference 
equations of this kind. To find the relation, we proceed by direct substitution. 
We take the definition given by Eq. (4.8) and, in the same way, we define the 
z-transform of the sequence {u,} as 
oo 


U(z) = X IR E (4.9) 


=—00 


Now we multiply Eq. (4.7) by z™ and sum over k. We get 


co oco T oo oo 
25 mat = b» al t 3 (E e,z ™ + p aaz) s (4.10) 


=- k=-% [—-—oo k=- 


From Eq. (4.9), we recognize the left-hand side as U(z). In the first term on the 
right, we let k — 1 = j to obtain 


Ye a Ay eRe (4.11) 


k=-00 j=- 


By similar operations on the third and fourth terms we can reduce Eq. (4.10) to 


U(z) = 27'U(z) + ZEO + z` E(2)). (4.12) 


Equation (4.12) is now simply an algebraic equation in z and the functions U and 
E. Solving it we obtain 
Iud 
U(z)= >= 
(z) 21 


= E(2). (4.13) 
AZ 
We define the ratio of the transform of the output to the transform of the input 
as the transfer function, H(z). Thus, in this case, the transfer function for 
trapezoid-rule integration is 

U(z) ~ Pot 

E(z) 2z-1 
For the more general relation given by Eq. (4.2), it is readily verified by the same 
techniques that 


(4.14) 


byt bz +- +b,” 


H(z) = ——————————, 
(z) l +a z +a”? Haz” 


zeros 
poles 
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and if n > m, we can write this as a ratio of polynomials in z as 


5 + Beate" + Ni + ip 


A(z) = —————————— 4.15 
(z) z" taz" d^ Eras T ( ) 
or 
b(z) 
H(z) = : 
(z) A 


This transfer function is represented in MATLAB in the tf form similarly to the 
continuous case as discussed after Eq. (2.6). The numerator of Eq. (4.15) would 
be specified in MATLAB as a1 x (n + 1) matrix of the coefficients, for example, 
whenm —n 


num —[b, b, b, ... b,] 


and when n > m, there would be n — m zeros after b, . The quantity specifying 
the denominator would be specified as a 1 x (m + 1) matrix, for example 


den = U xa, d; - 1l. 


Note that H(z) was assumed to be in the form given by Eq. (4.15), that is, 
with positive powers of z. The discrete system is specified as 


sys — tf(num, den, T) 


where T is the sample period. 
The general input—output relation between transforms with linear, constant, 
difference equations is 


U(z) = A(z) E(z). (4.16) 


Although we have developed the transfer function with the z-transforrn, it is also 
true that the transfer function is the ratio of the output to the input when both 
vary as z*. 

Because H(z) is a rational function of a complex variable, we use the ter- 
minology of that subject. Suppose we call the numerator polynomial b(z) and 
the denominator a(z). The places in z where b(z) = 0 are zeros of the transfer 
function, and the places in z where a(z) = O are the poles of H(z). If z, is a pole 
and (z — z,)’ H(z) has neither pole nor zero at z}, we say that H (z) has a pole of 
order p at z,. If p = 1, the pole is simple. The transfer function Eq. (4.14) has 
a simple pole at z = 1 and a simple zero at z = —1. When completely factored, 
the transfer function would be 


IT. (2 — 2j) 


H(z2)—K ; 
(z) Tena 


(4.17) 
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z^! and cycle delay 


Figure 4.3 
The unit delay 


4.2.3 


and the quantities specifying the transfer function in the MATLAB zpk form are 
an m x | matrix of the zeros, an n x 1 matrix of the poles, and a scalar gain, for 
example 


zy P 
a= 5s " p= P; s k=K 
Zm P, 


The system is then 


sys = zpk(z, p, k, T). 


We can now give a physical meaning to the variable z. Suppose we let 
all coefficients in Eq. (4.15) be zero except b, and we take b, to be 1. Then 
H(z) =z". But H(z) represents the transform of Eq. (4.2), and with these 
coefficient values the difference equation reduces to 


u, = ei (4.18) 


The present value of the output, u,, equals the input delayed by one period. Thus 
we see that a transfer function of z^! is a delay of one time unit. We can picture 
the situation as in Fig. 4.3, where both time and transform relations are shown. 

Since the relations of Eqs. (4.7), (4.14), (4.15) are all composed of delays, 
they can be expressed in terms of z~'. Consider Eq. (4.7). In Fig. 4.4 we illustrate 
the difference equation (4.7) using the transfer function z~' as the symbol for a 
unit delay. 

We can follow the operations of the discrete integrator by tracing the signals 
through Fig. 4.4. For example, the present value of e, is passed to the first summer, 
where it is added to the previous value e, ,, and the sum is multiplied by 7/2 to 
compute the area of the trapezoid between e, , and e,. This is the signal marked 
a, in Fig. 4.4. After this, there is another sum, where the previous output, 4, ,, 
is added to the new area to form the next value of the integral estimate, u,. The 
discrete integration occurs in the loop with one delay, z^, and unity gain. 


Block Diagrams and State-Variable Descriptions 


Because Eq. (4.16) is a linear algebraic relationship, a system of such relations 
is described by a system of linear equations. These can be solved by the methods 
of linear algebra or by the graphical methods of block diagrams in the same 


Ce 7 Uk] My = ex) 
E(z) U(z) = z7 VE(z) 
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Figure 4.4 
A block diagram of 


trapezoid integration as e, ae l, a, 
represented by Eq. (4.7) ~ n PEG) m Ps 


way as for continuous system transfer functions. To use block-diagram analysis 
to manipulate these discrete-transfer-function relationships, there are only four 
primitive cases: 


1. The transfer function of paths in parallel is the sum of the single-path transfer 
functions (Fig. 4.5). 

2. The transfer function of paths in series is the product of the path transfer 
functions (Fig. 4.6). 

3. The transfer function of a single loop of paths is the transfer function of the 
forward path divided by one minus the loop transfer function (Fig. 4.7). 

4. The transfer function of an arbitrary multipath diagram is given by combi- 
nations of these cases. Mason’s rule® can also be used. 


For the general difference equation of (4.2), we already have the transfer 
function in Eq. (4.15). It is interesting to connect this case with a block diagram 


Figure 4.5 
Block diagram of parallel 
blocks 


H(z) = H (2) + H2(2) 


Figure 4.6 


Block diagram of 
cascade blocks 
H(z) = H (H5) 


6 Mason (1956). See Franklin, Powell, and Emami-Naeini (2019) for a discussion. 
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Figure 4.7 
Feedback transfer 
function 


control canonical form 


HQ) 
1 —Hi (z) H4(z) 


H(z) = 


using only simple delay forms for z in order to see several "canonical" block 
diagrams and to introduce the description of discrete systems using equations of 
state. 


*Canonical Forms 


There are many ways to reduce the difference equation (4.2) to a block diagram 
involving z only as the delay operator, z~'. The first one we will consider leads 
to the “control” canonical form. We begin with the transfer function as a ratio 
of polynomials 


b 
U(z) = H(z)E(z) = A E) = b(z)£, 
where 
-EO 
v a(z) 
and thus 
alj = E(z). 


At this point we need to get specific; and rather than carry through with a 
system of arbitrary order, we will work out the details for the third-order case. 
In the development that follows, we will consider the variables u, e, and & 
as time variables and z as an advance operator such that zu(k) = u(k + 1) or 
z u(k) = u(k — 1). With this convention (which is simply using the property of 
z derived earlier), consider the equations 


(A+ a,z +a,z+a,)& =e, (4.19) 
(by + 6,27 + b,z be = u. (4.20) 
We can write Eq. (4.19) as 
zE =e—azé —a,zé — a$, 
E(k + 3) = e(k) — a €(k +2) — a(k + 1) — a,§ (k). (4.21) 
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Now assume we have z?£, which is to say that we have £(k + 3) because z? 
is an advance operator of three steps. If we operate on this with z^! three times 
in a row, we will get back to (k), as shown in Fig. 4.8(a). From Eq. (4.21), we 
can now compute z?£ from e and the lower powers of z and £ given in the block 
diagram; the picture is now as given in Fig. 4.8(b). To complete the representation 
of Eqs. (4.19) and (4.20), we need only add the formation of the output u as a 
weighted sum of the variables 2°&, z?£, z£, and & according to Eq. (4.20). The 
completed picture is shown in Fig. 4.8(c). 

In Fig. 4.8(c), the internal variables have been named x,, x,, and x,. These 
variables comprise the state of this dynamic system in this form. Having the 
block diagram shown in Fig. 4.8(c), we can write down, almost by inspection, the 
difference equations that describe the evolution of the state, again using the fact 
that the transfer function z^' corresponds to a one-unit delay. For example, we 
see that x,(k + 1) = x,(k) and x,(k + 1) = x, (k). Finally, expressing the sum at 
the far left of the figure, we have 


x(k +1) = —a,x,(k) — ajx,(k) — ax, (k) + e(k). 
We collect these three equations together in proper order, and we have 
x (k + 1) = —ayx, (k) — ajx,(k) — ax, (k) + e(k), (4.22) 
x(k + 1) = x (k), 
x(k + 1) = x,(k). 
Using vector-matrix notation, ’ we can write this in the compact form 


x(k + 1) = A x(k) + B e(k), 


where 
X 
X = X, 
X 
(4.23) 
—a, =a; —4 
A. = l 0 0 
0 1 0 
and 
l 
B, = Q4 (4.24) 


The output equation is also immediate except that we must watch to catch all 
paths by which the state variables combine in the output. The problem is caused 


7 We assume the reader has some knowledge of matrices. The results we require and references to study 
material are given in Appendix C. 
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Ek + 3) B Ek + 2) et Uk +1) ie] gk) 


(a) 


Figure 4.8 

Block diagram 
development of control 
canonical form: 

(a) Solving for &(k); 

(b) solving for ¿(k + 3) 
from e(k) and past £'s; 
(c) solving for U(k) from 
E'S 


observer canonical form 
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by the b, term. If b, = 0, then u = b,x, + b,x, + b,x,, and the corresponding 
matrix form is immediate. However, if b, is not 0, x, for example not only reaches 
the output through b, but also by the parallel path with gain —b,a,. The complete 
equation is 


u = (b, — a Bx + (b, — a,bo)x, + (b; — umbo) + boe. 
In vector/matrix notation, we have 
u = Cx+De 
where 


C, = [b,-ab, b,—a,b, b,- ab, | (4.25) 


0 


D. = b 


c 0* 


(4.26) 


We can combine the equations for the state evolution and the output to give the 
very useful and most compact equations for the dynamic system, 


x(k--1) = A,x(k) 4 B,e(&), 


where A, and B, for this control canonical form are given by Eq. (4.23), and C, 
and D, are given by Eq. (4.25). 

The other canonical form we want to illustrate is called the “observer” 
canonical form and is found by starting with the difference equations in opera- 
tor/transform form as 


3 2 he 2 
Zulu u+a,zu + a,u = bz ect bz e + b,ze + be. 


In this equation, the external input is e(k), and the response is u(k), which is 
the solution of this equation. The terms with factors of z are time-shifted toward 
the future with respect to k and must be eliminated in some way. To do this, we 
assume at the start that we have the u(k), and of course the e(k), and we rewrite 
the equation as 


S223 2 A. d n. 3 D 
b,e — a.u — zu t az'u t azu — byz e — b,z'e — b,ze. 


Here, every term on the right is multiplied by at least one power of z, and thus 
we can operate on the lot by z~' as shown in the partial block diagram drawn in 
Fig. 4.9(a). 

Now in this internal result there appear a,u and —b,e, which can be canceled 
by adding proper multiples of u and e, as shown in Fig. 4.9(b), and once they 
have been removed, the remainder can again be operated on by z~". 

If we continue this process of subtracting out the terms at k and operating on 
the rest by z~', we finally arrive at the place where all that is left is u alone! But 
that is just what we assumed we had in the first place, so connecting this term 
back to the start finishes the block diagram, which is drawn in Fig. 4.9(c). 
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Figure 4.9 
Block diagram «n 
development of observer 

canonical form: (a) the 

first partial sum and 

delay; (b) the second 

partial sum and delay; 

(c) the completion with 

the solution for u(k) 


z?u *a,zuta,u -boze — byze -b,e 


e(k) 


zu taju — bgze — bje 
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A preferred choice of numbering for the state components is also shown in 
the figure. Following the technique used for the control form, we find that the 
matrix equations are given by 


x(k + 1) = A,x(k) + B,e(X) 


u(k) = C,x(k) + D,e(k). (4.27) 
where 
=á 1 0 
A, = —a, 0 1 
—a, 0 0 
b, — b,a, 
B, = nc 
3 — 943 
C, = [1 0 0 | 
D, = [b]. 


The block diagrams of Figs. 4.8 and 4.9 are called direct canonical form 
realizations of the transfer function H(z) because the gains of the realizations 
are coefficients in the transfer-function polynomials. 

Another useful form is obtained if we realize a transfer function by placing 
several first- or second-order direct forms in series with each other, a cascade 

cascade canonical form canonical form. In this case, the H(z) is represented as a product of factors, 
and the poles and zeros of the transfer function are clearly represented in the 
coefficients. 

For example, suppose we have a transfer function 


= 2 0.52750.252-- 025 
~ z*-2.62 + 2.42? 0.8 
— (z* G^ — 0.52 + 0.25) 

(z? — zz — 1.62 +0.8) | 

The zero factor z + 1 can be associated with the pole factor z? — z to form 
one second-order system, and the zero factor z? —0.5z + 0.25 can be associated 
with the second-order pole factor z? ~ 1.6z + 0.8 to form another. The cascade 


factors, which could be realized in a direct form such as control or observer form, 
make a cascade form as shown in Fig. 4.10. 


Figure 4.10 | e(k) 2%- 0,524 0.25 
Block diagram of a "m 162 + 0.81 
cascade realization 3 j 


H(z) 


u(k) 
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Table 4.1 


4.2.4 Relation of Transfer Function to 


Pulse Response 


We have shown that a transfer function of z~' is a unit delay in the time domain. 
We can also give a time-domain meaning to an arbitrary transfer function. Recall 
that the z-transform is defined by Eq. (4.8) to be E(z) — hea, and the transfer 
function is defined from Eq. (4.16) as H(z) when the input and output are related 
by U(z) = H(z)E(z). Now suppose we deliberately select e(k) to be the unit 
discrete pulse defined by 


= L, (k =0), 
^7 0, (k #0), 
2. (4.28) 


Then it follows that E(z) — 1 and therefore that 
U(z) = H(z). (4.29) 


Thus the transfer function H(z) is seen to be the transform of the response to 
a unit-pulse input. For example, let us look at the system of Fig. 4.4 and put a 
unit pulse in at the e,-node (with no signals in the system beforehand). * We can 
readily follow the pulse through the block and build Table 4.1. 

Thus the unit-pulse response is zero for negative k, is T/2 at k = 0, and 
equals T thereafter. The z-transform of this sequence is 


oo oo 


H(z) = Die Sat 


Step-by-step construction of the unit pulse response for Fig. 4.4 


k e, e, 8, Uki Eh 
o o0 1 T2 0 T/2 

1 1 (0) T/2 T/2 T 

2 0 0 0 T T, 

3 0 0 0 T T 


8 In this development we assume that Eq. (4.7) is intended to be used as a formula for computing values 
of u, as k increases. There is no reason why we could not also solve for u, as k takes on negative 
values. The direction of time comes from the application and not from the recurrence equation. 
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If we add 7/2 to the z°-term and subtract 7/2 from the whole series, we have a 
simpler sum, as follows 


xe T 
Hm Tee 
k=0 2 
T py 
Myr (1 < IzD 
2T —T(1—$ 5 
20 =e") 
T +727! 
2(1— z^!) 
Tz+1 
= (à < |zl). (4.30) 
2z—1] 


Of course, this is the transfer function we obtained in Eq. (4.13) from direct 
analysis of the difference equation. 

A final point of view useful in the interpretation of the discrete transfer 
function is obtained by multiplying the infinite polynomials of E(z) and H(z) 
as suggested in Eq. (4.16). For purposes of illustration, we will assume that the 
unit-pulse response, /;,, is zero for k < 0. Likewise, we will take k = 0 to be 
the starting time for e,. Then the product that produces U(z) is the polynomial 
product given in Fig. 4.11. 


Figure 4.11 
Representation of the product E(z)H(z) as a product of polynomials 


eo tei! +e22~? +e32~3 4e 
ho +hız7! +h2z73 +hg2-3 eave 
eoho4-e1hoz^! +eghoz~? +eghoz~3 
+eoh1z7! +eihız7? +eoh,z73 
+eoh227? +eih2273 
+eph3z—3 


eghot+(eohi + eiho)z 1 4-(egha Teha + egho)z ? --(egha + e3ha + ezhi + egho)z ?4- EL 
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convolution 


Since this product has been shown to be U(z) = Xuz*, it must therefore 
follow that the coefficient of z~* in the product is u,. Listing these coefficients, 
we have the relations 

Uy = egt 

u, =e,h, +e,hy 

u, =e yh, teh, +e,h, 

u, = Ohh; Jet Heh, eh, 


The extrapolation of this simple pattern gives the result 


k 
u, = ) eh, j 
j=0 


By extension, we let the lower limit of the sum be —oo and the upper limit be 
+00: 


oo 


u, = y eh, (4.31) 


j=- 


Negative values of j in the sum correspond to inputs applied before time equals 
zero. Values for j greater than k occur if the unit-pulse response is nonzero for 
negative arguments. By definition, such a system, which responds before the 
input that causes it occurs, is called noncausal. This is the discrete convolution 
sum and is the analog of the convolution integral that relates input and impulse 
response to output in linear, constant, continuous systems. 

To verify Eq. (4.31) we can take the z-transform of both sides 


co oo oo 
EK. =k 
) Hz = ) z ) eh, j- 


-—00 =-00 j=- 


Interchanging the sum on j with the sum on k leads to 


-k 
= p» x 2n z hgj: 
j=- =—00 
Now let k — j = l in the second sum 
U(z) = D e, s hac un, 
j=-% l=- 
but z (*? = z^'z^/, which leads to 


U(z) = y EL Y hye 


j=- [2-09 


and we recognize these two separate sums as 


U(z) = E(z) H(z). 


4.2.5 


4.2 The Discrete Transfer Function 93 


We can also derive the convolution sum from the properties of linearity and 
stationarity. First we need more formal definitions of “linear” and “stationary.” 


1. Linearity: A system with input e and output u is linear if superposition 
applies, which is to say, if u,(k) is the response to e,(k) and u,(k) is the 
response to e,(k), then the system is linear if and only if, for every scalar o 
and £, the response to we, + Be, is eu, + Bu,. 

2. Stationarity: A system is stationary, or time invariant, if a time shift in the 
input results in only a time shift in the output. For example, if we take the 
system at rest (no internal energy in the system) and apply a certain signal 
e(k), suppose we observe a response u(k). If we repeat this experiment at 
any later time when the system is again at rest and we apply the shifted input, 
e(k — N), if we see u(k — N), then the system is stationary. A constant 
coefficient difference equation is stationary and typically referred to as a 
constant system. 


These properties can be used to derive the convolution in Eq. (4.31) as 
follows. If response to a unit pulse at k = O is h(k), then response to a pulse 
of intensity e, is e,h(k) if the system is linear. Furthermore, if the system is 
stationary then a delay of the input will delay the response. Thus, if 


Finally, by linearity again, the total response at time k to a sequence of these 
pulses is the sum of the responses, namely, 
ú acte PIENE, e F du E BID udo a TEM. 


or 


k 
u, = ) ena, 
1-0 


Now note that if the input sequence began in the distant past, we must include 
terms for / « 0, perhaps back to / — —oo. Similarly, if the system should be 
noncausal, future values of e where / > k may also come in. The general case is 
thus (again) 


d, sud. (4.32) 


I-—oo 


External Stability 


A very important qualitative property of a dynamic system is stability, and we 
can consider internal or external stability. Internal stability is concerned with 
the responses at all the internal variables such as those that appear at the delay 
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elements in a canonical block diagram as in Fig. 4.8 or Fig. 4.9 (the state). 
Otherwise we can be satisfied to consider only the external stability as given by 
the study of the input-output relation described for the linear stationary case by 
the convolution Eq. (4.32). These differ in that some internal modes might not be 
connected to both the input and the output of a given system. 

For external stability, the most common definition of appropriate response 
is that for every Bounded Input, we should have a Bounded Output. If this is true 
we say the system is BIBO stable. A test for BIBO stability can be given directly 
in terms of the unit-pulse response, /,. First we consider a sufficient condition. 
Suppose the input e, is bounded, that is, there is an M such that 


lel < M < œ for all. (4.33) 


If we consider the magnitude of the response given by Eq. (4.32), it is easy to see 


that 
lu < Ie, , 


which is surely less than the sum of the magnitudes as given by 


< » 'lellh, ,l. 


But, because we assume Eq. (4.33), this result is in turn bounded by 


x Mh, yl. (4.34) 
Thus the output will be bounded for every bounded input if 
$ Thy] « ee: (4.35) 
1——o0 


This condition is also necessary, for if we consider the bounded (by 1!) input 


=} 
e === (h70) 
FB 
—0 (h_, = 0) 
and apply it to Eq. (4.32), the output at k = O is 
Ho = >. eh 
l=-% 


Il 
Me 
me 
[S 


(4.36) 


Il 
Me 
= 


4.2 The Discrete Transfer Function 95 


Thus, unless the condition given by Eq. (4.35) is true, the system is not BIBO 
stable. 


€ Example 4.3 


Integration Stability 


Is the discrete approximation to integration (Eq. 4.7) BIBO stable? 


Solution. The test given by Eq. (4.35) can be applied to the unit pulse response used to 
compute the u, -column in Table 4.1. The result is 
h= T/2 
h, =T, k>0 
oo 
b I| = T/2 + p T — unbounded. (4.37) 
k=l 
Therefore, this discrete approximation to integration is not BIBO stable! 


* 
+ Example 4.4 General Difference Equation Stability 
Consider the difference equation (4.2) with all coefficients except a, and b, equal to zero 
U, — au, 4, + bee,. (4.38) 
Is this equation stable? 
Solution. The unit-pulse response is easily developed from the first few terms to be 
ug = bo; u= aby» u, = a?b,, 
u, =h, =b, k20. (4.39) 


Applying the test, we have 


oo oo k ] 
2 Ih = 2 bola = aaa (lal < 1) 
= unbounded (la| > 1). 


Thus we conclude that the system described by this equation is BIBO stable if |a| < 1, and 
unstable otherwise. 
* 


For a more general rational transfer function with many simple poles, we can 
expand the function in partial fractions about its poles, and the corresponding 
pulse response will be a sum of respective terms. As we saw earlier, if a pole 
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4.3 


4.3.1 


Figure 4.12 
The prototype 
sampled-data system 


is inside the unit circle, the corresponding pulse response decays with time 
geometrically and is stable. Thus, if all poles are inside the unit circle, the system 
with rational transfer function is stable; if at least one pole is on or outside the 
unit circle, the corresponding system is not BIBO stable. With modern computer 
programs available, finding the poles of a particular transfer function is no big 
deal. Sometimes, however, we wish to test for stability of an entire class of 
systems; or, as in an adaptive control system, the potential poles are constantly 
changing and we wish to have a quick test for stability in terms of the literal 
polynomial coefficients. In the continuous case, such a test was provided by 
Routh; in the discrete case, the most convenient such test was worked out by Jury 
and Blanchard(1961).” 


Discrete Models of Sampled-Data Systems 


The systems and signals we have studied thus far have been defined in discrete 
time only. Most of the dynamic systems to be controlled, however, are continuous 
systems and, if linear, are described by continuous transfer functions in the 
Laplace variable s. The interface between the continuous and discrete domains 
are the A/D and the D/A converters as shown in Fig. 1.1. In this section we 
develop the analysis needed to compute the discrete transfer function between 
the samples that come from the digital computer to the D/A converter and the 
samples that are picked up by the A/D converter.'? The situation is drawn in 
Fig. 4.12. 


Using the z-Transform 


We wish to find the discrete transfer function from the input samples u(kT) 
(which probably come from a computer of some kind) to the output samples 
y(kT) picked up by the A/D converter. Although it is possibly confusing at 
first, we follow convention and call the discrete transfer function G(z) when 
the continuous transfer function is G(s). Although G(z) and G(s) are entirely 
different functions, they do describe the same plant, and the use of s for the 
continuous transform and z for the discrete transform is always maintained. To 


u(k t YKT) 


9 See Franklin, Powell, and Workman, 2nd edition, 1990, for a discussion of the Jury test. 


10 In Chapter 5, a comprehensive frequency analysis of sampled data systems is presented. Here we 
undertake only the special problem of finding the sample-to-sample discrete transfer function of a 
continuous system between a D/A and an A/D. 


ZOH 


Figure 4.13 
D/A output for 
unit-pulse input 
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find G(z) we need only observe that the y(k7) are samples of the plant output 
when the input is from the D/A converter. As for the D/A converter, we assume 
that this device, commonly called a zero-order hold or ZOH, accepts a sample 
u(kT) at t = kT and holds its output constant at this value until the next sample 
is sent at t = kT + T. The piecewise constant output of the D/A is the signal, 
u(t), that is applied to the plant. 

Our problem is now really quite simple because we have just seen that the 
discrete transfer function is the z-transform of the samples of the output when the 
input samples are the unit pulse atk = 0. If u(kT) = 1 fork =O and u(kT) =0 
for k = 0, the output of the D/A converter is a pulse of width T seconds and height 
1, as sketched in Fig. 4.13. Mathematically, this pulse is given by 1(t) — 1 (t — T). 
Let us call the particular output in response to the pulse shown in Fig. 4.13 y (t). 
This response is the difference between the step response [to I(t) ] and the 
delayed step response [to 1(¢ — 7T)]. The Laplace transform of the step response 
is G(s)/s. Thus in the transform domain the unit pulse response of the plant is 


Ys tej a (4.40) 


and the required transfer function is the pudet. of the samples of the inverse 
of Hn (s), which can be expressed as 


G(z) = 2{¥,(kT)} 
zi" UO OSZ Y (s) 


zama 29 


This is the sum of two parts. The first part is Z( o). and the second is 


ater) ati 


because e~"* is exactly a delay of one period. Thus the transfer function is 


GR) =(1-z7)Z | N, (4.41) 
S 


T 2T 3T 4T 
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€ Example 4.5 


Discrete Transfer Function of 1st-Order System 


What is the discrete transfer function of 
G(s) — a/(s +a) 
preceded by a ZOH? 


Solution. We will apply the formula (4.41) 
G(s) — a 1 l 


s s(sca) s sta’ 


and the corresponding time function is 


ie ie —1()-e*1(rn). 


The samples of this signal are 1(kT) — e^^*7 | (T), and the z-transform of these samples is 


[I -—— 
s z=b Zeer 


La Lle 
NG =e") 
We could have gone to the tables in Appendix B and found this result directly as Entry 2. 


Now we can compute the desired transform as 


z-1 z(l—e) 


HRS 
z (z—-DG-e*") 
ler 
= M (4.42) 
* 
+ Example 4.6 Discrete Transfer Function of a 1/s Plant 
What is the discrete transfer function of 
| 
G(s) = 3 


s? 


preceded by a ZOH? 


Solution. We have 

1 

GQ) -(0-27)2 [5] 

s 
This time we refer to the tables in Appendix B and find that the z-transform associated with 
1/5? is 

T’ (zt) 
2 (2-1) 
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and therefore Eq. (4.41) shows that 


JT) 


G(z) 26 - DU 


(4.43) 


-+ 


The MATLAB function, c2d.m computes Eq. (4.41) (the ZOH method is the 
default) as well as other discrete equivalents discussed in Chapter 6. It is able to 
accept the system in any of the forms. 


€ Example 4.7 


Discrete Transfer Function of a 1/s” Plant Using MATLAB 


Use MATLAB to find the discrete transfer function of 


I 
G(s) = = 
(s) 2 
preceded by a ZOH, assuming the sample period is T = | sec. 
Solution. The MATLAB script 
=<): 
s=tf('s'); 
sysC = 1/s^2; 
sysD = c2d(sysC,T,'zoh') 
produces the result that 
0z? + 0.5z + 0.5 
fs A io ek 
z2—2z2+1 (=A) 


which is the same as Eq. (4.43) with 7 = 1. 


4.3.2 


*Continuous Time Delay 


We now consider computing the discrete transfer function of a continuous system 
preceded by a ZOH with pure time delay. The responses of many chemical 
process-control plants exhibit pure time delay because there is a finite time of 
transport of fluids or materials between the process and the controls and/or the 
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sensors. Also, we must often consider finite computation time in the digital 
controller, and this is exactly the same as if the process had a pure time delay. 
With the techniques we have developed here, it is possible to obtain the discrete 
transfer function of such processes exactly, as Example 4.8 illustrates. 


€ Example 4.8 


Discrete Transfer Function of 1st-Order System with Delay 


Find the discrete transfer function of the mixer in Appendix A.3 with a= 1, T = 1, and 
w= "15: 


Solution. The fluid mixer problem in Appendix A.3 is described by 
G(s) 2e H(s). 


The term e^? represents the delay of À seconds, which includes both the process delay and the 


computation delay, if any. We assume that H (s) is a rational transfer function. To prepare this 
function for computation of the z-transform, we first define an integer @ and a positive number 
m less than 1.0 such that A = €T — mT. With these definitions we can write 


G(s) | e e"75 H(s) 
ora s j 


Because £ is an integer, this term reduces to z~* when we take the z-transform. Because m < 1, 
the transform of the otherterm is quite direct. We select H (s) = a/(s + a) and, after the partial 
fraction expansion of H (s)/s, we have 


z-1 ems ents 
oe) = tar 2 |= - 55): 
To complete the transfer function, we need the z-transforms of the inverses of the terms in 
the braces. The first term is a unit step shifted left by m T seconds, and the second term is an 
exponential shifted left by the same amount. Because m « 1, these shifts are less than one full 
period, and no sample is picked up in negative time. The signals are sketched in Fig. 4.14. 
The samples are given by 1(k7) and e?" ** |(& T). The corresponding z-transforms 
are z/(z — 1) and ze^ "^" /(z — e^?" ). Consequently the final transfer function is 


G(z) = ls ==} 


z zliz—-l1 z-—e-T 


z—lfz-—e-*T-—(z—1)-2"T 


zt (z— 1)(z — e-*7) 
= (1-eomr) SF 0 
z'(z—e-9T) 
where the zero position is at —w = —(e^?"7 — e~°7)/(1 — e^7"7). Notice that this zero is 
near the origin of the z-plane when m is near | and moves outside the unit circle to near —oo 
when m approaches 0. For specific values of the mixer, we take a = 1, T = 1, and A = 1.5. 


Then we can compute that £ = 2 and m = 0.5. For these values, we get 


0.3935(z + 0.6025) (4.44) 


G(z) = 3G 0.3679) 


Figure 4.14 

Sketch of the shifted 
signals showing sample 
points 
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-27 -T O T 2T 3T 4T ST 6T 


-2T -T O T 2T 3T 4T ST 6T 


In MATLAB, the transfer function for this system would be computed by 
(d= 1.5; wats; L1 
sysC = tf(a[1 a],'OutputDelay', Td) 
SysD = c2d(sysC,T,zoh) 


4.3.3 


State-Space Form 


Computing the z-transform using the Laplace transform as in Eq. (4.41) is a 
very tedious business that is unnecessary with the availability of computers. We 
will next develop a formula using state descriptions that moves the tedium to 
the computer. A continuous, linear, constant-coefficient system of differential 
equations was expressed in Eq. (2.1) as a set of first-order matrix differential 
equations. For a scalar input, it becomes 


x = Fx+Gu+G,u, (4.45) 


where u is the scalar contro] input to the system and w is a scalar disturbance 
input. The output was expressed in Eq. (2.2) as a linear combination of the state, 
x, and the input, u, which becomes for scalar output 


y=Hx+ Ju. (4.46) 
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Often the sampled-data system being described is the plant of a control problem, 
and the parameter J in Eq. (4.46) is zero and will frequently be omitted. 


€ Example 4.9 


State Representation of a 1/s? Plant 


Apply Eqs. (4.45) and (4.46) to the double integrator plant of the satellite control problem in 
Appendix A.1 


l 
G(s) 2 =. 
()7 3 
Solution. The satellite attitude-control example is shown in block diagram form in Fig. 4.15 


and the attitude (0) and attitude rate (8) are defined to be x, and X3, respectively. Therefore, 
the equations of motion can be written as 


ae lau ea 


F ——6 (4.47) 
0-2y 


pi eT [e] 


which, in this case, turns out to be a rather involved way of writing 


=u. 


Figure 4.15 

Satellite attitude control 
in classical 
representation 


* 


The representations given by Eqs. (4.45) and (4.46) are not unique. Given 
one state representation, any nonsingular linear transformation of that state such 
as Bx — Tx is also an allowable alternative realization of the same system. 

If we let £ — Tx in Eqs. (4.45) and (4.46), we find 


é = Tx = T(Fx + Gu + Gw) 
= TFx + TGu + TG,w, 
ë= TFT” + TGu + TG,w, 
Y=HT-!E+ Ju. 
If we designate the system matrices for the new state € as A, B, C, and D, then 


€=AE+Bu+Bw, y= CE Du, 


u ô 8 
: f 
X2 *1 
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where 


A-TET", B=TG, B,-TG, €-HT',D-Il 


1 


€ Example 4.10 


State Transformation for 1/s? Plant 


Find the state representation for the case with the state definitions of the previous example 
interchanged. 


Solution. Let£, = x, and & = x, in Eq. (4.47); or, in matrix notation, the transformation to 
interchange the states is 


In this case T^! — T, and application of the transformation equations to the system 
matrices of Eq. (4.47) gives 


SIE: 5] »- |; | ee T4 
+ 


Most often, a change of state is made to bring the description matrices into a 
useful canonical form. We saw earlier how a single high-order difference equation 
could be represented by a state description in control or in observer canonical 
form. Also, there is a very useful state description corresponding to the partial- 
fraction expansion of a transfer function. State transformations can take a general 
description for either a continuous or a discrete system and, subject to some 
technical restrictions, convert it into a description in one or the other of these 
forms, as needed. 

We wish to use the state description to establish a general method for obtain- 
ing the difference equations that represent the behavior of the continuous plant. 
Fig. 4.16 again depicts the portion of our system under consideration. Ultimately, 
the digital controller will take the samples y(k), operate on that sequence by 
means of a difference equation, and put out a sequence of numbers, u(k), which 
are the inputs to the plant. The loop will, therefore, be closed. To analyze the 
result, we must be able to relate the samples of the output y(k) to the samples of 
the control u(k). To do this, we must solve Eq. (4.45). 

We will solve the general equation in two steps. We begin by solving the equa- 
tion with only initial conditions and no external input. This is the homogeneous 
equation 


X, —Fx() x, (4) =X: (4.48) 
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Figure 4.16 

System definition with 
sampling operations 
shown 


(kT) (kT) 


To solve this, we assume the solution is sufficiently smooth that a series expansion 
of the solution is possible 


X,(t) =A, F A(t 1) c A,(t H. (4.49) 


If we let t = tj, we find immediately that A, = x,. If we differentiate Eq. (4.49) 
and substitute into Eq. (4.48), we have 


A, +2A,(t — 1) + 3A,(t — t) +--+ = Fx, 


and, at 1 = to» A, = Fx,. Now we continue to differentiate the series and the 
differential equation and equate them at t, to arrive at the series 


F? T ^ 2 F? a 3 
x(t) = [I+ RU = 1) 4 dae 


This series is defined as the matrix exponential and written 


x, (t) = eF'7x(r.), (4.50) 
where, by definition, the matrix exponential is 
(t —1,)° (= i)? 
F=) — 2 0 3 0 NM 
gum EIER =E — +F =" + 
xæ 1 k 
See E (4.51) 
k=O : 


It can be shown that the solution given by Eq. (4.50) is unique, which leads 
to very interesting properties of the matrix exponential. For example, consider 
two values of t : t, and tj. We have 


x(t) = eF^7t?)x(t) 
and 
X(t) = ex (t,). 


Because f, is arbitrary also, we can express x(t,) as if the equation solution began 
at t, for which 


x(t) = Fa x(t). 
Substituting for x(t,) gives 


X(t) = ehh oF, ~ x(t). 
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We now have two separate expressions for x(t,), and, if the solution is unique, 
these must be the same. Hence we conclude that 


eF) — efh) gF(t-t) (4.52) 


for all t, t, tọ Note especially that if t, = t, then 


] e Fo gF nto. 


Thus we can obtain the inverse of eF by merely changing the sign of t! We will 
use this result in computing the particular solution to Eq. (4.45). 

The particular solution when 4 is not zero is obtained by using the method 
of variation of parameters.'' We guess the solution to be in the form 


x,(t) e"? v(r), (4.53) 


where v(t) is a vector of variable parameters to be determined [as contrasted to the 
constant parameters x(f,) in Eq. (4.50)]. Substituting Eq. (4.53) into Eq. (4.45), 
we obtain 


Fel oy + gFt-t9y = Feo + Gu. 
and, using the fact that the inverse is found by changing the sign of the exponent, 
we can solve for ù as 
w(t) = a 9Gu(t). 


Assuming that the control u(r) is zero for t < t,, we can integrate ù from t, to t 
to obtain 


t 
v(t) = e Fe Gu(t)dt. 
fo 


Hence, from Eq. (4.53), we get 


t 
x) = aut e FW Gu(r)dr, 
t, 


0 


and simplifying, using the results of Eq. (4.52), we obtain the particular solution 
(convolution) 


x,(t) = i e"? Gu(r)dr. (4.54) 


0 


The total solution for w = 0 and u zz 0 is the sum of Eqs. (4.50) and (4.54): 


Gert OK) +f e 7? Gu(t)dt. (4.55) 
t, 


0 


11 Due to Joseph Louis Lagrange, French mathematician (1736-1813). We assume w = 0, but because 
the equations are linear, the effect of w can be added later. 
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We wish to use this solution over one sample period to obtain a difference 
equation: hence we juggle the notation a bit (let £ = KT + T and t, = kT) and 
arrive at a particular version of Eq. (4.55): 


kT+T 
x(kT + T) = ef x(kT) + ji eXTYT-9 Gy (r)dr. (4.56) 
kT 
This result is not dependent on the type of hold because u is specified in 
terms of its continuous time history, u(t), over the sample interval. A common 
and typically valid assumption is that of a zero-order hold (ZOH) with no delay, 
that is, 


u(t) — u(kT), kT xt «KT 4 T. 


If some other hold is implemented or if there is a delay between the application 
of the control from the ZOH and the sample point, this fact can be accounted for 
in the evaluation of the integral in Eq. (4.56). The equations for a delayed ZOH 
will be given in the next subsection. To facilitate the solution of Eq. (4.56) for a 
ZOH with no delay, we change variables in the integral from r to » such that 


n=kT+T—-t. 
Then we have 
T 
X(kT + T) = e'"x(kT) +f e™dnGu(kT). (4.57) 
0 
If we define 
d$ = e" 
T 
r= l e™"dnG, (4.58) 
L] 


Eqs. (4.57) and (4.46) reduce to difference equations in standard form 


x(k + 1) = x(k) + Pu(k) + Pw), 
y(k) = Hx(&), (4.59) 


where we include the effect of an impulsive or piecewise constant disturbance, 
w, and assume that J = O in this case. If w is a constant, then I" is given by 
Eq. (4.58) with G replaced by G,. If w is an impulse, then T, = G,." The ® 
series expansion 

pu^ ger 


+ 


= BET .x 
=e" =I+FT + 2 31 


Jut 


12 If w(r) varies significantly between its sample values, then an integral like that of Eq. (4.56) is required 
to describe its influence on x(k + 1). Random disturbances are treated in Chapter 9. 


Figure 4.17 

Program logic to 
compute ® and F from 
F,G, and T for simple 
cases. (The left arrow «— 
is read as "is replaced 
by.") 
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can also be written 


p-I-FTy, (4.60) 


where FT PT 
Teea g 
The F integral in Eq. (4.58) can be evaluated term by term to give 


eo F* Tit 


= YTG. (4.61) 


We evaluate W by a series in the form 


FT FT FT FT 
abo Les IB e ES 4.62 
Me aes ee) a — em 


which has better numerical properties than the direct series of powers. We then 
find I from Eq. (4.61) and ® from Eq. (4.60). A discussion of the selection of N 
and a technique to compute V for comparatively large T is given by Källström 
(1973), and a review of various methods is found in a classic paper by Moler 
and Van Loan (1978). The program logic for computation of and T' for simple 
cases is given in Fig. 4.17. MATLAB's c2d.m and all control design packages that 
we know of compute ® and F from the continuous matrices F, G, and the sample 
period T. 

To compare this method of representing the plant with the discrete transfer 
functions, we can take the z-transform of Eq. (4.59) with w = 0 and obtain 


[z1 — JX = l'U(z), 
Y(z) 2 HX(z) (4.63) 


Select sampling period T and description matrices F and G. 
Matrix I <+- Identity 

Matrix V <I 

k <- 11 [We are using N = 11 in Eq. (4.62).] 

If k = 1, goto step 9. 

Matrix W «— I + Ew 

k<-k+1 

Go to step 5. 

Matrix IT +- TWG 

Matrix ® +- I+ FTW 


O9 ot RUM 


— 
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therefore 


¥@) Hzr- orr. 


UG) (4.64) 


€ Example 4.11 


® and T Calculation 


By hand, calculate the «b and I" matrices for the satellite attitude-control system of Example 
4.9. 


Solution. Use Eqs. (4.60) and (4.61) and the values for F and G defined in Eq. (4.47). Since 
F? = 0 in this case, we have 
®=14+FT+ = Te 
[eee aaa £] 
r= LEER T "x | 


[se spell -U?] 


hence, using Eq. (4.64), we obtain 


which is the same result that would be obtained using Eq. (4.41) and the z-transform tables. 
Note that the values for ® and I" could have been obtained for a specific value of T by the 
MATLAB statements 


sysC = ss(F,G,H,J) 
sysD = c2d(sysC,T,zoh) 
[phi,gam,h,J] = ssdata(sysD) 


* 


Note that to compute Y/U we find that the denominator is the determi- 
nant det(zI — ®), which comes from the matrix inverse in Eq. (4.64). This 
determinant is the characteristic polynomial of the transfer function, and the 
zeros of the determinant are the poles of the plant. We have two polesat z — 1 
in this case, corresponding to the two integrations in this plant's equations of 
motion. 
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We can explore further the question of poles and zeros and the state-space 
description by considering again the transform equations (4.63). An interpretation 
of transfer-function poles from the perspective of the corresponding difference 
equation is that a pole is a value of z such that the equation has a nontrivial 
solution when the forcing input is zero. From Eq. (4.63a), this implies that the 
linear eigenvalue equations 


[zI — $]X(z) = [0] 


have a nontrivial solution. From matrix algebra the well-known requirement 
for this is that det(zI — d») = 0. Using the ® from the previous example, we 


have 
dezt- ©} = aet|| g EE a 


E ze iB 
= aer | 0 al 


=(z— 1) = 0, 


which is the characteristic equation, as we have seen. To compute the poles 
numerically when the matrices are given, one would use an eigenvalue routine. 
In MATLAB, the statement 


lam=eig(phi) 


will produce a vector, lam, of the poles of «b. 

Along the same line of reasoning, a system zero is a value of z such that the 
system output is zero even with a nonzero state-and-input combination. Thus if 
we are able to find a nontrivial solution for X(z,) and U(z,) such that Y (z,) is 
zero, then z, is a zero of the system. Combining the two parts of Eq. (4.59), we 
must satisfy the requirement 


zl-¢% -T XD T 
l H 0 l | Ul) | = [0]. (4.65) 


Once more the condition for the existence of nontrivial solutions is that the 
determinant of the square coefficient system matrix be zero." For the satellite 
example, we have 


2-4 =F TUS 2 
dt} 0 z-1 -T =1- det] =e E d 
1 p 0 á 


13 We do not consider here the case of different numbers of inputs and outputs. 
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4.3.4 


ue "E 
lit 1^ 


T? 
= + — 1). 
t CEN 


Thus we have a single zero at z = —1, as we have seen from the transfer func- 
tion. These zeros are called transmission zeros and are easily computed using 
MATLAB's tzero.m.'* Using the discrete model sysD found in Example 4.11 the 
statement 


zer=tzero(sysD) 


produces the transmission zeros in the quantity zer. 


*State-Space Models for Systems with Delay 


Thus far we have discussed the calculation of discrete state models from con- 
tinuous, ordinary differential equations of motion. Now we present the formulas 
for including a time delay in the model and also a time prediction up to one 
period which corresponds to the modified z-transform as defined by Jury. We 
begin with a state-variable model that includes a delay in control action. The state 
equations are 


x(t) = Fx(t) + Gu(t — A), (4.66) 
y = Hx. 


The general solution to Eq. (4.66) is given by Eq. (4.55); it is 


x(t) 2 e" ?x(t) + ji god Adr. 
% 


If we let t, = KT and t = kT + T, then 


AT4T 
X(kT + T) = eF'x(kT) + i eF ETT NG u(t — A) dt. 
AT 
If we substitute n = kT + T — r for t in the integral, we find a modification of 
Eq. (4.57) 
i 0 
X(kT + T) = e"x(kT) «f e™Gu(kT + T — à — n)(—dn) 


T 


T 
= XT) «f e"Gu(kT +T — à — 9) d. 
0 


14 In using this function, one must be careful to account properly for the zeros that are at infinity; the 
function might return them as very large numbers that the user must remove to "uncover" the finite 
zeros; that is, to scale the finite numbers so they don’t appear to be zero by the computer. 


Figure 4.18 

Sketch of a piecewise 
input and time axis for a 
system with time delay 
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If we now separate the system delay A into an integral number of sampling periods 
plus a fraction, we can define an integer £ and a positive number m less than one 
such that 


À —t£T —mT, (4.67) 


and 
Ces 0, 


0 zm <1. 


With this substitution, we find that the discrete system is described by 
T 
x(kT + T) = e" x(kT) + f e"Gu(kT +T - £T +mT —n)dn. (4.68) 
0 


If we sketch a segment of the time axis near t = kT — £T (Fig. 4.18), the nature 
of the integral in Eq. (4.68) with respect to the variable n will become clear. The 
integral runs for n from O to T, which corresponds to t from kT — £T + T +mT 
backward to kT — £T + mT. Over this period, the control, which we assume is 
piecewise constant, takes on first the value u(kT — £T + T) and then the value 
u(kT — £T). Therefore, we can break the integral in (2.66) into two parts as 
follows 


mT 
x(kT + r)- exer) + f e™Gdnu(kT— ET +T) 
0 


F 
: + e""Gdgu(kT — £T) 
mT 


= @x(kT) + ',u(kT — £T) + V,u(kT — £T +T). (4.69) 
In Eq. (4.69) we defined 


T. mT 
pz". T, =i e™Gdn, and T, -[ e" G dn. (4.70) 
m 0 


T 


u(t) 


AT-2T  kT-$9T*T kT-T+2T t 
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To complete our analysis it is necessary to express Eq. (4.69) in standard state- 
space form. To do this we must consider separately the cases of £ = 0, £ = 1, 
and £ > 1. 

For £ = 0, A = —mT according to Eq. (4.67), which implies not delay but 
prediction. Because mT is restricted to be less than T, however, the output will 
not show a sample before k = 0, and the discrete system will be causal. The result 
is that the discrete system computed with £ = 0, m zz 0 will show the response 
at t = 0, which the same system with £ = 0, m = 0 would show at t = mT. In 
other words, by taking £ = 0 and m = 0 we pick up the response values between 
the normal sampling instants. In z-transform theory, the transform of the system 
with £ = 0, m = 0 is called the modified z-transform.' The state-variable form 
requires that we evaluate the integrals in Eq. (4.70). To do so we first convert I’, 
to a form similar to the integral for I',. From Eq. (4.70) we factor out the constant 
matrix G to obtain 


T 
I =j e™dyG. 


T 
If we set o = n — mT in this integral, we have 


T-mT 
y = f ef Tt dg G 
[U 


T-mT 
= ef i e" do G. (4.71) 
0 


For notational purposes we will define, for any positive nonzero scalar number, 
a, the two matrices 


(a) =e", W(a)— | e™ do. (4.72) 
0 


In terms of these matrices, we have 
r-(T- mT)B(mT)V, 
r, =mTÝ. (4.73) 


The definitions in Eqs. (4.72) are also useful from a computational point of view. 
If we recall the series definition of the matrix exponential 


then we get 


€ 
Lun 
S 
— 
Il 


ale Ale 


2 p* at 
Dy k! k -1 


k=0 


15 See Jury (1964) or Ogata (1987). 
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zi men (4.74) 


If we let k = j + 1 in the sum, then, as in Eq. (4.60), we have 
oo Fit! git! 


bhis —— 
un 2. G+)! 


9 Fai 
= F 
um Gap 


j=0 


=I+aW(a)F. (4.75) 


The point of Eq. (4.75) is that only the series for W(a) needs to be computed and 
from this single sum we can compute ® and F. 
If we return to the case £ = 0, m = 0, the discrete state equations are 


x(k +1) = x(k) +T u(k) + Fuk + 1), 


where I’, and T, are given by Eq. (4.73). In order to put these equations in 
state-variable form, we must eliminate the term in u(k + 1). To do this, we define 
a new state, E(k) = x(k) — l';u(k). Then the equations are 


E(k +1) 2 x(k +1) —T,u(k +1) 
= @x(k) - P,u(k) + P,u(k + 1) - P4u(k +1), 
E(k + 1) = P[k) + D,u(k)] + Tuk) 
= (k) + (T, + )u(k) 
= (k) +Tu(k). (4.76) 


The output equation is 


y(k) = Hx(k) 
= H{ġ(k) + F,u(k)] 
= H&(k) + HI',u(k) 
= H,&(k) + Jau (k). (4.77) 
Thus for £ = 0, the state equations are given by Eqs. (4.73), (4.76), and (4.77). 
Note especially that if m = 0, then I, = 0, and these equations reduce to the 


previous model with no delay. 
Our next case is £ = 1. From Eq. (4.69), the equations are given by 


x(k + 1) = x(k) +P yuk — 1) + T,u(k). 
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In this case, we must eliminate u (k — 1) from the right-hand side, which we do 
by defining a new state x, , (K) = u(k — 1). We have thus an increased dimension 
of the state, and the equations are 


x(k 4- 1) mE" x(k) E; 
Pa 1) | > | OO | Ee t [t E d 
yk) 2[H aes IE (4.78) 
nti 
For our final case, we consider £ > 1. In this case, the equations are 


x(k + 1) = x(k) c 'u(k — 10) + Tuk — £4 10) 


and we must eliminate the past controls up to u(k). To do this we introduce £ 
new variables such that 


ik) = ulk- 8), x(k) = Uk €T, s xU) = uk - D. 


X 


The structure of the equations is 


o L Ij, 22 0 x(k) 0 
x(k + 1) 0 0 3 0 aes 0 E Tous 0 
Xx, T) Q 44 O 19 | Panos 0 
X, 4G + 1) =e ee ak | x, 4) +] o | uk) 
x(k +1) OF <0 0) 10, Or al : 
z 0 0 o 0 ode l 
x(k) 
Xp 00) 
yxkb=[Ħ 0- 0]| (4.79) 
x Qu) 
This final solution is easily visualized in terms of a block diagram, as shown 


in Fig. 4.19. 


4.3.5 *Numerical Considerations in Computing ® 
and I 


The numerical considerations of these computations are centered in the approxi- 
mation to the infinite sum for W given by Eq. (4.74) or, fora = T, by Eq. (4.62). 
The problem is that if FT is large, then (FT)"/N! becomes extremely large be- 
fore it becomes small, and before acceptable accuracy is realized most computer 
number representations will overflow, destroying the value of the computation. 
Källström (1973) has analyzed a technique used by Kalman and Englar (1966), 
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Figure 4.19 
Block diagram of system with delay of more than one period. Double line indicates 
vector valued variables 


which has been found effective by Moler and Van Loan (1978). The basic idea 
comes from Eq. (4.52) with t, — t, = 2T and t, — t, = T, namely 


(eT? = m efTe = e, (4.80) 


Thus, if T is too large, we can compute the series for T/2 and square the result. 
If T/2 is too large, we compute the series for 7/4, and so on, until we find a k 
such that T/2* is not too large. We need a test for deciding on the value of k. We 
propose to approximate the series for W, which can be written 


LIETA T/A 
=W ; 
v(z JE M emn TES AR: p 


We will select k, the factor that decides how much the sample period is divided 
down, to yield a small remainder term R. Kállstróm suggests that we estimate 
the size of R by the size of the first term ignored in V, namely, 


R= (FT)"/(N + 1)!2™. 


A simpler method is to select k such that the size of FT divided by 2* is less 
than 1. In this case, the series for FT /2* will surely converge. The rule is to select 
k such that 


2* > ||FT || 2 max F. |T. 
IIET || a 


Taking the log of both sides, we find 
k > log, || FT ||, 


from which we select 


k = max([ log, | |FT ||, 0), (4.81) 
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Figure 4.20 
Logic for a program to 
compute W using 


automatic time scaling 


where the symbol [x means the smallest integer greater than x. The maximum 
of this integer and zero is taken because it is possible that || FT || is already so 
small that its log is negative, in which case we want to select k = 0... 

. Having selected k, we now have the problem of computing W(7) from 
Yr (T/2*). Our original concept was based on the series for ®, which satisfied 
Eq. (4.80). To obtain the suitable formula for W, we use the relation between ® 
and W given by Eq. (4.60) as follows to obtain the “doubling” formula for W 


$QT) = B(T)®(T), 
I - 27FWQT) = [I + TFW(T)JI + TFW(T)] 
—IJ42TFW(T) 4- T?F^w?(T); 


therefore 
2TFWQT) = 2TFW(T) + PFW (T). 


This is equivalent to 
TF 
W(2T) = (1 + zw) v (T), 


which is the form to be used. The program logic for computing W is shown in 
Fig. 4.20.6 This algorithm does not include the delay discussed in Section 4.3.4. 


Select F and 7. 

Comment: Compute || FT ||. 

Ve max,{2, | Fl) xT 

k < smallest nonnegative integer greater than log, V. 
Comment: compute W(7/2"). 

T, — TJ? 

I < Identity 

wel 

j<ll 

If j = 1, go to step 14. 

ll Ww eI-5w 

12. je j-1 

13. Goto step 10. 

14. Comment: Now double W k times. 
15. Ifk =O, stop. 

16. We (I+ Z5 w)v 

17. k«—k-1 

18. Goto step 15. 


CASO" 90 cJ Oy Un, e Ne 


16 Similar logic is used by MATLAB in C2D.M. 
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For that, we must implement the logic shown in Fig. 4.19. In the Control Toolbox, 
the function c2d.m executes the logic with a delay if one is specified. 


*Nonlinear Models 


Contrary to the predominant developments in this book, models of dynamic sys- 
tems are generally nonlinear. However, it is more difficult to apply analysis to 
nonlinear models and, thus, less insight is gained if models are left in their nonlin- 
ear form throughout the entire design process. Controls engineers commonly use 
numerical simulation of nonlinear models to evaluate the performance of control 
systems, a technique that should always be a part of any control system design. 
To aid in the design synthesis of controllers and to gain insight into approximate 
behavior, it is often advantageous to linearize the system so the methods in this 
text can be utilized. 
We begin with the assumption that our plant dynamics are adequately de- 
scribed by a set of ordinary differential equations in state-variable form as 
x = fion, P ELT Ir Mes t), 
x, = Js A RU CER TRA t), 


x, m | arate 
Yi =h p” X 


, 7n? 


LLENO TTE X vlr vt) (4.82) 


or, more compactly in matrix notation, we assume that our plant dynamics are 
described by 


x = f(x, u, 2), 
x(t,) = Xp, 
y = h(x, u, ¢). (4.83) 


One proceeds as follows with the process of linearization and small-signal 
approximations. We assume stationarity by the approximation that f and h do not 
change significantly from their initial values at tj. Thus we can set 

X = f(x, u, to) 


or, simply 


x = f(x, u), y = h(x, u). (4.84) 
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The assumption of small signals can be reflected by taking x and u to be always 
close to their reference values X, u,, and these values, furthermore, to be an 
equilibrium point of Eq. (4.82), where 


f(x,, u) = 0. (4.85) 


Now, if x and u are “close” to x, and u,, they can be written as x = x, + ôx; u = 
u, + du, and these can be substituted into Eq. (4.84). The fact that ôx and ôu are 
small is now used to motivate an expansion of Eq. (4.84) about x, and u, and to 
suggest that the only terms in the first power of the small quantities ôx and ôu 
need to be retained. We thus have a vector equation and need the expansion of a 
vector-valued function of a vector variable, 


d 
PTAA, + 6x) = f(x, + ôx, uy + óu). (4.86) 


If we go back to Eq. (4.82) and do the expansion of the components f, one at a 
time, it is tedious but simple to verify that Eq. (4.86) can be written as" 


ôx = f(x, uy) +f, (Xs, ujJóx + f., (Xp, uôu t. (4.87) 


where we define the partial derivative of a scalar f, with respect to the vector x 


by a subscript notation: 

za f Of, af, 

=(— --- —]. 4.88 

fix 6 x, no 

The row vector in Eq. (4.88) is called the gradient of the scalar f, with respect 
to the vector x. If f is a vector, we define its partial derivatives with respect to the 
vector x as the matrix (called the Jacobean) composed of rows of gradients. In 
the subscript notation, if we mean to take the partial of all components, we omit 
the specific subscript such as 1 or 2 but hold its place by the use of a comma 


af, 
Ox 
eu 
famol 70x; |a (4.89) 


Now, to return to Eq. (4.87), we note that by Eq. (4.85) we chose x,, u, to be an 
equilibrium point, so the first term on the right of Eq. (4.87) is zero, and because 
the terms beyond those shown depend on higher powers of the small signals 6x 


and óu, we are led to the approximation 


8x VE, (Xp, Uy)Ox +F,,, (Xp, Uy)dU, 
dy = h,, 6x + h, du. (4.90) 


17 Note that dx,/d! = 0 because our “reference trajectory” x, is a constant here. 
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But now the notation is overly clumsy, so we drop the ôx, ôu and ôy notation and 
simply call them x, u and y and define the constant matrices 

F == f Xo u), G = ie Oh up); 

H = h, (x, uj), J — h,, (Xp uj). 


This results in the form we used earlier in Section 2.1.1 
x = Fx + Gu, y = Hx + Ju. (4.91) 


We go even further and restrict ourselves to the case of single input and single 
output and discrete time. We then write the model as 


x(k + 1) = x(k) + Fu(&), 
y(k) = Hx(&) + Ju(k), (4.92) 


from which the transfer function is 
G(z) = Y(z)/U(z) = H(z- 9) T + J. (4.93) 


Thus we see that nonlinear models can be approximated as linear state-space 
models or as transfer function models. The accuracy of the approximation varies 
with the problem, but is generally useful in designing the control system. The final 
design of the control system should always be checked via numerical simulation 
of the nonlinear equations. 


Signal Analysis and Dynamic Response 


In Section 4.2 we demonstrated that if two variables are related by a linear constant 
difference equation, then the ratio of the z-transform of the output signal to that 
of the input is a function of the system equation alone, and the ratio is called 
the transfer function. A method for study of linear constant discrete systems is 
thereby indicated, consisting of the following steps: 


. Compute the transfer function of the system H (z). 

. Compute the transform of the input signal, E(z). 

. Form the product, E(z) A(z), which is the transform of the output signal, U. 
. Invert the transform to obtain u(kT). 


Aa UN = 


If the system description is available in difference-equation form, and if the 
input signal is elementary, then the first three steps of this process require very 
little effort or computation. The final step, however, is tedious if done by hand, 
and, because we will later be preoccupied with design of transfer functions to 
give desirable responses, we attach great benefit to gaining intuition for the kind 
of response to be expected from a transform without actually inverting it or nu- 
merically evaluating the response. Our approach to this problem is to present a 
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4.4.1 


4.4.2 


repertoire of elementary signals with known features and to learn their represen- 
tation in the transform or z-domain. Thus, when given an unknown transform, 
we will be able, by reference to these known solutions, to infer the major fea- 
tures of the time-domain signal and thus to determine whether the unknown is 
of sufficient interest to warrant the effort of detailed time-response computation. 
To begin this process of attaching a connection between the time domain and the 
z-transform domain, we compute the transforms of a few elementary signals. 


The Unit Pulse 


We have already seen that the unit pulse is defined by ® 


e(k)=1 (k=0) 


=0 (k = 0) 
= 5,3 
therefore we have 
E\2)= D018, 2% =2= 1, (4.94) 


This result is much like the continuous case, wherein the Laplace transform of 
the unit impulse is the constant 1.0. 

The quantity E,(z) gives us an instantaneous method to relate signals to 
systems: To characterize the system H(z), consider the signal u(k), which is the 
unit pulse response; then U(z) = H(z). 


The Unit Step 


Consider the unit step function defined by 
€,(k) = 1 (k 2 0) 


=0 (k « 0) 
= 1(k). 
In this case, the z-transform is 
Bil 3; boh y an 
k=—00 k=0 
e. (z « 
Tg m : 
ees: (le D): (4.95) 
z—1 


18 We have shifted notation here to use e(k) rather than e, for the kth sample. We use subscripts to 
identify different signals. 


Figure 4.21 


(a) Pole and zero of E, (2) 
in the z-plane. The unit 
circle is shown for 
reference. (b) Plot of 


e,(k) 


4.4.3 
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Here the transform is characterized by a zero at z = 0 and a pole at z = 1. The 
significance of the convergence being restricted to |z| > 1 will be explored later 
when we consider the inverse transform operation. The Laplace transform of the 
unit step is 1/5; we may thus keep in mind that a pole at s = 0 for a continuous 
signal corresponds in some way to a pole at z — 1 for discrete signals. We will 
explore this further later. In any event, we record that a pole at z = 1 with 
convergence outside the unit circle, |z| = 1, will correspond to a constant for 
positive time and zero for negative time. 

To emphasize the connection between the time domain and the z-plane, we 
sketch in Fig. 4.21 the z-plane with the unit circle shown and the pole of E,(z) 
marked x and the zero marked o. Beside the z-plane, we sketch the time plot of 
é,(k). 


(a) (b) 


Exponential 
The one-sided exponential in time is 
e (k) on (k > 0) 
=0 (k<0) (4.96) 


which is the same as 7*1(k), using the symbol 1(k) for the unit step function. 
Now we get 


oo 
E,(z) = 5 ort 
k=0 


aF ra 
k=0 
l 


]-rz' 
Z 


ge 


(Irz^'| « 1) 


(lzl > Irl). (4.97) 
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Figure 4.22 


(a) Pole and zero of £,(z) 
in the z-plane. (b) Plot of 


ey) 


4.4.4 


The pole of E,(z) is at z = r. From Eq. (4.96) we know that e,(k) grows 
without bound if |r| > 1. From Eq. (4.97) we conclude that a z-transform that 
converges for large z and has a real pole outside the circle |z| — 1 corresponds 
to a growing signal. If such a signal were the unit-pulse response of our system, 
such as our digital control program, we would say the program was unsteble as 
we saw in Eq. (4.39). We plot in Fig. 4.22 the z-plane and the corresponding time 
history of E,(z) as e,(k) for the stable value, r = 0.6. 


(a) (b) 


General Sinusoid 


Our next example considers the modulated sinusoid e,(k) = [r* cos(k8)]1(K), 
where we assume r > 0. Actually, we can decompose e,(k) into the sum of two 
complex exponentials as 


Jk6 —jk6 
e, =r! (== ) 1(k), 


and because the z-transform is linear, we need only compute the transform of 
each single complex exponential and add the results later. We thus take first 


e,(k) = r*e"^1(k) (4.98) 


and compute 


19 We have not shown this formally. The demonstration, using the definition of linearity given above, is 
simple and is given in Section 4.6. 


Figure 4.23 
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l 


Z 
= ey (\z| > r). (4.99) 


The signal e,(k) grows without bound as k gets large if and only if r > 1, and 
a system with this pulse response is BIBO stable if and only if |r| < 1. The 
boundary of stability is the unit circle. To complete the argument given before 
for e,(k) = r* cos kô 1(k), we see immediately that the other half is found by 
replacing 0 by —0 in Eq. (4.99) 


Z 


Z{rkeJ* 1(k)} =—_. (izl >r), (4.100) 
z—re? 
and thus that 
: = 1 z z 
Ez) = 3 P mper deem a) 
z(z — r cosĝ) E 
22 — 9r(cos0)z +r? dn. (4.101) 


The z-plane pole-zero pattern of E,(z) and the time plot of e,(k) are shown in 
Fig. 4.23 for r — 0.7 and0 — 45*. 

We note in passing that if 0 = 0, then e, reduces to e, and, with r = 1, toe,,so 
that three of our signals are special cases of e,. By exploiting the features of E,(z), 
we can draw a number of conclusions about the relation between pole locations 
in the z-plane and the time-domain signals to which the poles correspond. We 
collect these for later reference. 


(a) Poles and zeros of E,(z) for 0 = 45°, r = 0.7 in the z-plane. (b) Plot of e,(K) 


(a) 
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1. The settling time of a transient, defined as the time required for the signal to 
decay to one percent of its maximum value, is set mainly by the value of the 
radius, r, of the poles. 


(a) r > 1 corresponds to a growing signal that will not decay at all. 


(b) r = 1 corresponds to a signal with constant amplitude (which is not 
BIBO stable as a pulse response). 


(c) For r « 1, the closer r is to 0 the shorter the settling time. The corre- 
sponding system is BIBO stable. We can compute the settling time in 
samples, X, in terms of the pole radius, r. 


Pole Radius Response Duration 


r N 
0.9 43 
0.8 21 
0.6 9 
0.4 5 


(d) A pole at r = 0 corresponds to a transient of finite duration. 


2. The number of samples per oscillation of a sinusoidal signal is determined 
by 0. If we require cos(0k) = cos(0(k + N)), we find that a period of 2x 
rad contains N samples, where 


2x 360 
N2—|,2— samples/cycle. 
9 Lai We ples/cy 


For 0 = 45°, we have N = 8, and the plot of e,(k) given in Fig. 4.23(b) 
shows the eight samples in the first cycle very clearly. A sketch of the unit 
circle with several points corresponding to various numbers of samples per 
cycle marked is drawn in Fig. 4.24 along with other contours that will be 
explained in the next section. The sampling frequency in Hertz is 1/7, and 
the signal frequency is f = 1/NT so that N = f,/f and 1/N is a normalized 
signal frequency. Since 0 = (2x)/N, 0 is the normalized signal frequency 
in radians/sample. 0/T is the frequency in radians/second. 


A compilation of signal responses versus their pole location in the z-plane is 
shown in Fig. 4.25. It demonstrates visually the features just summarized for the 
general sinusoid, which encompasses all possible signals. 


Figure 4.24 
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Sketch of the unit circle with angle marked in numbers of samples per cycle 


NO 


N=2 “OR 


-1.0 -0.6 


FTL Y 
||| [OPER Rs 
a le ee a A NB 


~ 


-0.4 


z = plane loci of roots of constant f and w, 


s= t V LT E 


zcmeh 
T = sampling period 


4.4.5 


Correspondence with Continuous Signals 


From the calculation of these few z-transforms, we have established that the 
duration of a time signal is related to the radius of the pole locations and the 
number of samples per cycle is related to the angle, 0. Another set of very useful 
relationships can be established by considering the signals to be samples from a 
continuous signal, e(t), with Laplace transform E(s). With this device we can 
exploit our knowledge of s-plane features by transferring them to equivalent z- 
plane properties. For the specific numbers represented in the illustration of e,, we 
take the continuous signal 
y(t) =e" cos bt 1(t) (4.102) 

with 

aT = 0.3567, 

bT = 7/4. 
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Figure 4.25 
Time sequences associated with pole locations in the z-plane 


Im(z) 


7^ / 0s Fi \ X 1 Re(z) 


And, taking samples one second apart (T — 1), we have 
k 
YKT) = (e-9959* cos Hk) 
k 
= (0.7) cos PIU) 
= eK). 
The poles of the Laplace transform of y(t) (in the s-plane) are at 


5,,— —a c jb, —a — jb. 


Figure 4.26 
Corresponding lines in 
the s-plane and the 
z-plane according to 
l= ev 
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From Eq. (4.101), the z-transform of E,(z) has poles at 
ijs npn ene do 
but because y (k T) equals e, (X), it follows that 


r=“, 0-—bT 


If E(z) is a ratio of polynomials in z, which will be the case if e(k) is generated 
by a linear difference equation with constant coefficients, then by partial fraction 
expansion, E(z) can be expressed as a sum of elementary terms like E, and ad 
In all such cases, the discrete signal can be generated by samples from continuous 
signals where the relation between the s-plane poles and the corresponding z- 
plane poles is given by 


z=". (4.103) 


If we know what it means to have a pole in a certain place in the s-plane, then 
Eq. (4.103) shows us where to look in the z-plane to find a representation of 
discrete samples having the same time features. It is useful to sketch several 
major features from the s-plane to the z-plane according to Eq. (4.103) to help 
fix these ideas. Such a sketch is shown in Fig. 4.26. 

Each feature should be traced in the mind to obtain a good grasp of the 
relation. These features are given in Table 4.2. We note in passing that the map 


s-plane 2-plane 


(a) (b) 


20 Unless a pole of E(z) is repeated. We have yet to compute the discrete version of a signal corresponding 
to a higher-order pole. The result is readily shown to be a polynomial in k multiplying r*e/*’. 
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Table 4.2 


Description of corresponding lines in s-plane and z-plane 


4.4.6 


s-plane Symbol z-plane 
S — jo x xXx jz] 2 1 
Real frequency axis Unit circle 
s—-a20 DNOO z=r21 
s=a <0 OOO z=r,0<er<1 
s= —£o, t jo 1-4 AAA z «re wherer = exp(-£o,T) 
=-a +jb = ear, 
Constant damping ratio / 2 
=w T, = 
if ¢ is fixed and w ? On Y y bT 
: n" Logarithmic spiral 
varies 
s=4t(x/T) 4 o, a <0 z=-r 


z = e of Eq. (4.103) is many-to-one. There are many values of s for each value 
of z. In fact, if 


2x 
$,—5, TU uei, 


then eh? = e*? , The (great) significance of this fact will be explored in Chapter 5. 

Lines of constant damping in the s-plane are mapped into the z-plane ac- 
cording to Eq. (4.103) for several values of ¢ in Fig. 4.24. We often refer to the 
damping of a pole in the z-plane in terms of this equivalent s-plane damping, or 
sometimes we simply refer to the damping of a z-plane pole. Likewise, lines of 
constant natural frequency, c, , in the s-plane (semi-circles centered at the origin) 
are also mapped into the z-plane according to Eq. (4.103) for several values of w, 
in Fig. 4.24. It's interesting to note that in the immediate vicinity of z = +1, the 
map of Z and c, looks exactly like the s-plane in the vicinity of s = 0. Because 
of the usefulness of this mapping, the Control System Toolbox has the function 
zgrid.m that allows one to superimpose this mapping on various plots to help in 
the interpretation of the results. You will see its use in the figure files of discrete 
root loci in Chapter 7. 


Step Response 


Our eventual purpose, of course, is to design digital controls, and our interest in the 
relation between z-plane poles and zeros and time-domain response comes from 
our need to know how a proposed design will respond in a given dynamic situation. 
The generic dynamic test for controls is the step response, and we will conclude 
this discussion of discrete system dynamic response with an examination of 
the relationships between the pole-zero patterns of elementary systems and the 
corresponding step responses for a discrete transfer function from u to y of a 
hypothetical plant. Our attention will be restricted to the step responses of the 
discrete system shown in Fig. 4.27 for a selected set of values of the parameters. 


Figure 4.27 
Definition of the 
parameters of the 
system whose step 
responses are to be 
catalogued 


Figure 4.28 

Pole-zero pattern of Y(z) 
for the system of Fig. 
4.27, with z, = 

Py Ulz) 2 z/(G — 1), a, 
and a, selected for 

0 = 18°, and¢ =0.5 
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(z ~2,)@—24) 


(z — py 2? ~a; z+a,) 


Note that if z, = p,, the members of the one pole-zero pair cancel out; 
and if at the same time z, = r cos(@), a = —2r cos(0), and a, = r?, the system 
response, Y (z), to the input with transform U(z) = 1 (a unit pulse) is 


z—rcos0 


Y(z) — (4.104) 


z!-2rcos0z +r? 
This transform, when compared with the transform E,(z) given in Eq. 
(4.101), is seen to be 


Y(z) 2 z E); 


and we conclude that under these circumstances the system pulse response is a 
delayed version of e, (&), a typical second-order system pulse response. 

For our first study we consider the effect of zero location. We let z, — p, 
and explore the effect of the (remaining) zero location, z,, on the step-response 
overshoot for three sets of values of a, and a,. We select a, and a, so that the 
poles of the system correspond to a response with an equivalent s-plane damping 
ratio ¢ = 0.5 and consider values of 0 of 18, 45, and 72 degrees. In every case, 
we will take the gain K to be such that the steady-state output value equals the 
step size. The situation in the z-plane is sketched in Fig. 4.28 for 0 = 18°. The 
curve for ¢ = 0.5 is also shown for reference. In addition to the two poles and 
one zero of H(z), we show the pole at z = | and the zero at z = 0, which come 
from the transform of the input step, U (z), given by z/(z — 1). 
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Figure 4.29 

Plot of step responses 
for a discrete plant 
described by the 
pole-zero pattern of 
Fig. 4.28 for various 
values of z, 


The major effect of the zero z, on the step response y(k) is to change the 
percent overshoot, as can be seen from the four step responses for this case plotted 
in Fig. 4.29. To summarize all these data, we plot the percent overshoot versus 
zero location in Fig. 4.30 for ¢ = 0.5 and in Fig. 4.31. for  — 0.707. The major 
feature of these plots is that the zero has very little influence when on the negative 
axis, but its influence is dramatic as it comes near +1. Also included on the plots 
of Fig. 4.30 are overshoot figures for a zero in the unstable region on the positive 
real axis. These responses go in the negative direction at first, and for the zero 
very near +1, the negative peak is larger than 1?! 

Our second class of step responses corresponds to a study of the influence of 
a third pole on a basically second-order response. For this case we again consider 
the system of Fig. 4.27, but this time we fix z, — z, — —1 and let p, vary from 
near —1 to near +1. In this case, the major influence of the moving singularity 
is on the rise time of the step response. We plot this effect for 0 — 18, 45, and 
72 degrees and ¢ = 0.5 on Fig. 4.32. In the figure we defined the rise time as the 
time required for the response to rise to 0.95, which is to 5% of its final value. 
We see here that the extra pole causes the rise time to get very much longer as 
the location of p, moves toward z = +1 and comes to dominate the response. 

Our conclusions from these plots are that the addition of a pole or zero to a 
given system has only a small effect if the added singularities are in the range from 
0 to —1. However, a zero moving toward z = +1 greatly increases the system 
overshoot. A pole placed toward z = +1 causes the response to slow down and 
thus primarily affects the rise time, which is progressively increased. 

The understanding of how poles and zeros affect the time response is very 
useful for the control system designer. The knowledge helps guide the iterative 


yk) = 02, =0.9 
Bo da oz, 70.8 

g x Z= 0.7 

o E ez, 7 06 


21 Such systems are called nonminimum phase by Bode because the phase shift they impart to a 
sinusoidal input is greater than the phase of a system whose magnitude response is the same but that 
has a zero in the stable rather than the unstable region. 


Figure 4.30 

Effects of an extra zero 
on a discrete 
second-order system, 
er—p5sgif8 55. 
and 72° 


4.5 
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design process and helps the designer understand why a response is the way it 
is. Ultimately, however, the test of a design is typically the actual time response, 
either by numerical simulation or an experimental evaluation. Today, transform 
inversion would never be carried out. In MATLAB, the numerical simulation of 
the impulse response for a discrete system, sysD is accomplished by 


y = impulse(sysD) 
and the discrete step response by 
y — step(sysD) 


Invoked without a left-hand argument (y =), both functions result in a plot of the 
response on the screen. 


Frequency Response 


A very important concept in linear systems analysis is the frequency response. 
If a sinusoid at frequency w, is applied to a stable, linear, constant, continuous 
system, the response is a transient plus a sinusoidal steady state at the same 
frequency, w,, as the input. If the transfer function is written in gain-phase 
form as H(jw) = A(w)e/*), then the steady-state response to a unit-amplitude 
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Figure 4.31 

Effects of an extra zero 
on a discrete 
second-order system, 
c= 0:707:0'= 18° 45°, 
and 72°: percent 
overshoot versus zero 
location 


Figure 4.32 

Effects of an extra pole 
on rise time for a 
discrete third-order 
System, two zeros at —1, 
one zero at oo; ¢ = 

0.5: 0 2 18?, 45°, 72? 
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sinusoidal signal has amplitude A(w,) and phase W (w,) relative to the input 
signal. 

We can say almost exactly the same respecting the frequency response of a 
stable, linear, constant, discrete system. If the system has a transfer function H (z), 
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we define its magnitude and phase for z taking on values around the unit circle 
by H (e/*") = A(wT)e/¥@”. If a unit-amplitude sinusoid is applied, then in the 
steady state, the response samples will be on a sinusoid of the same frequency 
with amplitude A(w,7) and phase V (c, T). It is worthwhile going through the 
calculations to fix ideas on this point. 

From Eq. (4.16), the discrete response transform is 


Uc) = Hz) E(2). (4.105) 


If e(k) = cos(c, Tk) (k), then, from Eq. (4.101) with r = 1 and 0 = wT, we 
have 


1 z z 
z) = 4.106 
B)=5 ren (4.106) 
If we substitute Eq. (4.106) into Ea. (4.105), we obtain 
"3 l zH(z) |  zH(z) 
U(z) -3l; De jp (4.107) 


The steady state of u(k T) corresponds to the terms in the expansion of Eq. (4.107) 
associated with the two poles on the unit circle. If we expand U(z)/z into partial 
fractions and multiply back by z, the steady state part can be found as 


1 H(el*.7 l —je,T 
ins EL DUE 

55 2 Z— e” 9 g= eit 
If H (e/^^") = A (w, T)e/ "7, then we have 


A ez A ez 


Jlalz) = - —, 
Unt) 3 uu T 95 ee (4.108) 
and the inverse transform of U .(z) is 
U (kT) = Te ein + A iV p-iw,Tk 
$55 2 
= Acos(w,Tk + y), (4.109) 


which, of course, are samples at kT instants on a sinusoid of amplitude A, phase 
w, and frequency w, 

We will defer the plotting of particular frequency responses until later chap- 
ters (see, for example, Figs. 6.3, 6.8, 7.16, and 7.28). However, it should be 
noticed here that although a sinusoid of frequency w, could be passed through 
the samples of Eq. (4.109), there are other continuous sinusoids of frequency 
w, + £27 / T for integer £ which also pass through these points. This is the phe- 
nomenon of aliasing, to which we will return in Chapter 5. Here, we define the 
discrete frequency response of a transfer function H (z) to sinusoids of frequency 
c, as H (e/*»7) so that the amplitude A and phase y are 


A = |H(e/*o")| and y = Z(H (e!”")) (4.110) 
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4.5.1 


Fast Fourier Transform 


which can be evaluated and plotted by MATLAB’s bode.m with the scripts 
sysD = tf(num,den,T) 
bode(sysD) 
where amplitude is plotted in decibels (dB), or 
[mag,phase,w] = bode(sysD) 
subplot(2,1,1), loglog(w,mag) 
subplot(2,1,2), semilogx(w, phase) 


where amplitude is plotted as a ratio as in the figures in this text. If the system is 
described by the state-space matrices, the scripts above can be invoked with 


SysD = ss(F.G,H,J,T). 


*The Discrete Fourier Transform (DFT) 


The analysis developed above based on the z-transform is adequate for con- 
sidering the theoretical frequency response of a linear, constant system or the 
corresponding difference equation, but it is not the best for the analysis of real- 
time signals as they occur in the laboratory or in other experimental situations. 
For the analysis of real data, we need a transform defined over a finite data record, 
which can be computed quickly and accurately. The required formula is that of 
the Discrete Fourier Transform, the DFT, and its numerical cousin, the Fast 
Fourier Transform, the FFT. Implementation of a version of the FFT algo- 
rithm is contained in all signal-processing software and in most computer-aided 
control-design software. 

To understand the DFT, it is useful to consider two properties of a signal and 
its Fourier transform that are complements of each other: the property of being 
periodic and the property of being discrete. In ordinary Fourier analysis, we have 
a signal that is neither periodic nor discrete and its Fourier transform is also 
neither discrete nor periodic. If, however, the time function f(t) is periodic with 
period 7,, then the appropriate form of the transform is the Fourier series, and 
the transform is defined only for the discrete frequencies w = 27 n/T,. In other 
words, if the function in time is periodic, the function in frequency is discrete. 
The case where the properties are reversed is the z-transform we have just been 
studying. In this case, the time functions are discrete, being sampled, and the 
z-transform is periodic in w; for if z = e/*", corresponding to real frequencies, 
then replacing w = w+ 2x k/T leaves z unchanged. We can summarize these 
results with the following table: 
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Time Frequency 


Fourier Series periodic discrete 
z-transform discrete periodic 


Suppose we now have a time function that is both periodic and discrete. 
Based on what we have seen, we would expect the transform of this function also 
to be both periodic and discrete. And this is the case, which leads us to the finite 
discrete Fourier transform and its finite inverse. Let the time function in question 
be f(kT) = f(kT + NT). Because the function is periodic, the transform can 
be defined as the finite sum 


2xn va : 
FI—Iz kT fda akT)/(NT) | 
(Z5) 2f Je 


This is the same as the z-transform over one period evaluated at the discrete 
frequencies of a Fourier series w = 2zn/NT. It is standard practice to suppress 
all the arguments except the indices of time and frequency and write 


N-I 
R=) wie re (4.111) 
k=0 


To complete the DFT, we need the inverse transform, which, by analogy with the 
standard Fourier transform, we guess to be the sum 


N-1 
jam(nk)/N 
bg Fe A 


n=0 


If we substitute Eq. (4.111) with summing index @ into this, we find 


N-1 f N-1 
) ) fe SECUN ei” OoN, 
n= £—0 


Interchanging the order of the summations gives 


N-1 N-I 
) oh ) afi OU 
£=0 


n=0 


The sum in the braces is a finite geometric series, which we can evaluate as 
follows 


N—1 j2n(k— 
Y. cian -OUN — le 
1 — <i27(k-0/N 


n=0 
_ [ÍN  k-t-0 
0 k—£—1,2,...,N—1. 


136 Chapter4 Discrete Systems Analysis 


The sum is periodic with period N. With this evaluation, we see that the sum we 
have been considering is N f,, and thus we have the inverse sum 


D. n 
"a m(nk)/N 
h= Ly F ei ; (4.112) 


Equations (4.111) and (4.112) comprise the DFT 


N-I 
" — j2n (nk N 
E, - y peuo, 
k=0 
1 N=) 
ARN j2m(nk)/N 
fi, = N X ke : 


n=0 


Because there are N terms in the sum in Eq. (4.111), it would appear that to 
compute the DFT for one frequency it will take on the order of N multiply and 
add operations; and to compute the DFT for all N frequencies, it would take on 
the order of N? multiply and add operations. However, several authors, especially 
Cooley and Tukey (1965), have showed how to take advantage of the circular 
nature of the exponential so that all N values of F, can be computed with the 
order of N log(N) operations if N is a power of 2. For N = 1024, this is a saving 
of a factor of 100, a very large value. Their algorithm and related schemes are 
called the Fast Fourier Transform or FFT. 

To use the DFT/FFT in evaluating frequency response, consider a system 
described by Eq. (4.105) and where the input is a sinusoid at frequency c, — 
2x £/NT so that e(k T) = Asin(2zx £kT/NT). We apply this input to the system 
and wait until all transients have died away. At this time, the output is given by 
u(kT) = B sin(2xlk/N + y). The DFT of e(k) is 


N-1 
2n £k p 
2, = Sa sn (Z2) im 


1 
= $i A [GN = gH ne) eg iQrnbo/N 
2j 


N-1 
U, — ) Bsin (E 2 v) er iOnnky/N 


7 [e/* ci REIN qi g-JOREDIN] q- jenem 
J 


4.6 


4.6.1 
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zi ni. 


Dividing these results, we see that with sinusoidal input and output, the frequency 
response at the frequency w = (27 £)/ NT is given by 


H (eI) = zA 


£ 
where U, = FFT (u,) and E, = FFT (e,), each evaluated at n = £. We will 
discuss in Chapter 12 the general problem of estimation of the total frequency 
response from experimental data using the DFT/FFT as well as other tools. 


Properties of the z-Transform 


We have used the z-transform to show that linear, constant, discrete systems 
can be described by a transfer function that is the z-transform of the system’s 
unit-pulse response, and we have studied the relationship between the pole-zero 
patterns of transfer functions in the z-plane and the corresponding time responses. 
We began a table of z-transforms, and a more extensive table is given in Appendix 
B. In Section 4.6.1 we turn to consideration of some of the properties of the z- 
transform that are essential to the effective and correct use of this important tool. 
In Section 4.6.2 convergence issues concerning the z-transform are discussed and 
in Section 4.6.3 an alternate derivation of the transfer function is given. 


Essential Properties 


In order to make maximum use of a table of z-transforms, one must be able to 
use a few simple properties of the z-transform which follow directly from the 
definition. Some of these, such as linearity, we have already used without making 
a formal statement of it, and others, such as the transform of the convolution, 
we have previously derived. For reference, we will demonstrate a few properties 
here and collect them into Appendix B for future reference. In all the properties 
listed below, we assume that F(z) = Z{ f,(kT)}. 


1. Linearity: A function f(x) is linear if f (ax, + Bx,) = af(x,) + Bf (x). 
Applying this result to the definition of the z-transform, we find immediately 
that 


Z(ef KT) + BAET = Do (of 00 + 8f khe 


k=- 


= a2Z{f,(k)} + BZU GO) 
= oF, (z) + BF). 
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Thus the z-transform is a linear function. It is the linearity of the transform 
that makes the partial-fraction technique work. 


2. Convolution of Time Sequences: 


Z | ; AOAK -»| = F (2)F,(2). 


l=-20 


We have already developed this result in connection with Eq. (4.32). It is this 
result with linearity that makes the transform so useful in linear-constant- 
system analysis because the analysis of a combination of such dynamic 
systems can be done by linear algebra on the transfer functions. 


3. Time Shift: 
Z(f(k +n)} =z F(z). (4.113) 


We demonstrate this result also by direct calculation: 


ZAfk+n))= » fk + nz. 


k=-00 


If we let k +n = j, then 


Zif +n = Y fg 


j==0o 


= 2 F(z). QED 


This property is the essential tool in solving linear constant-coefficient dif- 
ference equations by transforms. We should note here that the transform of 
the time shift is not the same for the one-sided transform because a shift 
can introduce terms with negative argument which are not included in the 
one-sided transform and must be treated separately. This effect causes initial 
conditions for the difference equation to be introduced when solution is done 
with the one-sided transform. See Problem 4.13. 


4. Scaling in the z-Plane: 
Z{r™* f(k)} = F(rz). (4.114) 


By direct substitution, we obtain 


oo 


eir tog eee 


k=-00 


=> quer 


k=- 


= F(rz). QED 
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As an illustration of this property, we consider the z-transform of the unit 
step, 1 (k), which we have computed before 


26 


ZTTUONe n = 
k=0 


z—1 
By property 4 we have immediately that 


3 rz Z 

ANg) rz—1 z — (1ry 
As a more general example, if we have a polynomial a(z) = z? + az+a, 
with roots re*/®, then the scaled polynomial a?z? + a,az +a, has roots 
(r/a)e*/*. This is an example of radial projection whereby the roots of a 
polynomial can be projected radially simply by changing the coefficients 
of the polynomial. The technique is sometimes used in pole-placement de- 
signs as described in Chapter 8, and sometimes used in adaptive control as 
described in Chapter 13. 


. Final-Value Theorem: If F(z) converges for |z| > 1 and all poles of (z — 
1) F(z) are inside the unit circle, then 


lim fk) lim(z — D F(z). (4.115) 


The conditions on F(z) assure that the only possible pole of F(z) not strictly 
inside the unit circle is a simple pole at z = 1, which is removed in (z — 
1) F(z). Furthermore, the fact that F(z) converges as the magnitude of z 
gets arbitrarily large ensures that f (k) is zero for negative k. Therefore, all 
components of f (k) tend to zero as k gets large, with the possible exception 
of the constant term due to the pole at z — 1. The size of this constant is 
given by the coefficient of 1/(z — 1) in the partial-fraction expansion of 
F(z), namely 


E= lim(z — 1) F(z). 


However, because all other terms in f(k) tend to zero, the constant C is the 
final value of f(k), and Eq. (4.115) results. QED 
As an illustration of this property, we consider the signal whose transform 
is given by 
Zeal 
U(z) = ——————À 
tee OM 
Because U (z) satisfies the conditions of Eq. (4.115), we have 
z Tz+! 
AN AEE 


jz] 1. 


lim u(k) = lim(z — 1) 
k= zl 
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€ Example 4.12 


T 
=—(1+1 
rose 


-» 


This result can be checked against the closed form for u(k) given by 
Eq. (4.121) below. 


. Inversion: As with the Laplace transform, the z-transform is actually one of a 


pair of transforms that connect functions of time to functions of the complex 
variable z. The z-transform computes a function of z from a sequence in k. 
(We identify the sequence number & with time in our analysis of dynamic 
systems, but there is nothing in the transform per se that requires this.) The 
inverse z-transform is a means to compute a sequence in k from a given 
function of z. We first examine two elementary schemes for inversion of a 
given F(z) which can be used if we know beforehand that F(z) is rational in z 
and converges as z approaches infinity. For a sequence f (k), the z-transform 
has been defined as 


oo 

FO= M fü*,  m«lgpeR. (4.116) 
k—-oo 

If any value of f(k) for negative k is nonzero, then there will be a term in 
Eq. (4.116) with a positive power of z. This term will be unbounded if the 
magnitude of z is unbounded; and thus if F(z) converges as |z| approaches 
infinity, we know that f(k) is zero for k < 0. In this case, Eq. (4.116) is 

one-sided, and we can write 


Fübeo* Juss ne (4.117) 
k=0 


The right-hand side of Eq. (4.117) is a series expansion of F(z) about 
infinity or about z^! = 0. Such an expansion is especially easy if F(z) is the 
ratio of two polynomials in z~'. We need only divide the numerator by the 
denominator in the correct way, and when the division is done, the coefficient 
of z * is automatically the sequence value f (k). An example we have worked 
out before will illustrate the process. 


z-Transform Inversion by Long Division 


The system for trapezoid-rule integration has the transfer function given by Eq. (4.14) 


Tz+1 
H(z) = =—. lz| > 1. 
2z-1 


Determine the output for an input which is the geometric series represented by e,(k) with 
r = 0.5. That is 


E,(z) = -s 


0.5. 
op — > 
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Solution. The z-transform of the output is 


U(z) = E,(z) H(z) 
zd Z E 
~~ Osa zl 
Equation (4.118) represents the transform of the system output, «(k). Keeping out the factor 
of T/2, we write U (z) as aratio of polynomials in z 


[2| > 1. (4.118) 


T 1z4z'! 


dS E 
(09 351—157 4 0577 


(4.119) 


and divide as follows 


Fl 2.527! + 3.252? + 3.62527 +- 


= = 1 = aa 
1 — 1.527! + 0.527? )l+z 


1- 1.527! + 0.5272 
2.525) =05z2-° 
2.527! — 3.7527? 4 1.25273 
325277 — 1.2527? 
3.2527? — 4.81527 2? + 1.625z~4 
3.625273 — 1.625274 
3.6252? —..- 


By direct comparison with U(z) = P» u(k)z *, we conclude that 
uy = T/2, 
u, = (T/2)2.5, 
u, = (T/2)3.25, 


(4.120) 


Clearly, the use of a computer will greatly aid the speed of this process in all but the simplest 
of cases. Some may prefer to use synthetic division and omit copying over all the extraneous 
z’s in the division. The process is identical to converting F(z) to the equivalent difference 
equation and solving for the unit-pulse response. 


* 


The second special method for the inversion of z-transforms is to decompose 
F(z) by partial-fraction expansion and look up the components of the sequence 
f(k) in a previously prepared table. 


€ Example 4.13 


z-Transform Inversion by Partial Fraction Expansion 


Repeat Example 4.12 using the partial fraction expansion method. 


142 Chapter4 Discrete Systems Analysis 


Solution. Consider again Eq. (4.118) and expand U(z) as a function of z^! as follows 


TI4z! l A B 
"D= Zi COSE lec O 
We multiply both sides by 1 — z7’, let z^! = 1, and compute 
n2 

A= 2: 05 mop 

Similarly, at z^ = 2, we evaluate 
Ti+2 3T 
"£e 2 


Looking back now at e, and e,, which constitute our "table" for the moment, we can copy 
down that 


u, = Ae,(k) + Be,(k) 
3I 
= 2Te,(k) - eA) 
3T (1M 
(2r — EH (5) 1(k) 


1È 3 
=> j+- z| 1(K). (4.121) 


2 9t 
Evaluation of Eq. (4.121) fork = 0, 1, 2, .. . will, naturally, give the same values for u(k) as 
we found in Eq. (4.120). 


4.6.2 


* 


* Convergence of z-Transform 


We now examine more closely the role of the region of convergence of the 
z-transform and present the inverse-transform integral. We begin with another 
example. The sequence 


—-] k<0O 
has the transform 


-l co 
F(z) = x -z*52- be — ] 
0 


z-1 
This transform is exactly the same as the transform of the unit step 1 (k), Eq. (4.95), 


except that this transform converges inside the unit circle and the transform of the 
1(K) converges outside the unit circle. Knowledge of the region of convergence 
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is obviously essential to the proper inversion of the transform to obtain the time 
sequence. The inverse z-transform is the closed, complex integral? 


fk) = gs proc’, (4.122) 
2nj Zz 


where the contour is a circle in the region of convergence of F(z). To demonstrate 
the correctness of the integral and to use it to compute inverses, it is useful to 
apply Cauchy’s residue calculus [see Churchill and Brown (1984)]. Cauchy’s 
result is that a closed integral of a function of z which is analytic on and inside a 
closed contour C except at a finite number of isolated singularities z, is given by 


l 
sh F(z) dz = D Resla (4.123) 


In Eq. (4.123), Res(z,) means the residue of F(z) at the singularity at z,. We will 
be considering only rational functions, and these have only poles as singularities. 
If F(z) has a pole of order n at z,, then (z — z) F(z) is regular at z, and can be 
expanded in a Taylor series near z, as 

(2 —2,)"F(z) = A_, +A pg (2 — %) Hee + AqG-z)" 


=n 


+A- zD +. (4.124) 


The residue of F(z) at z, is A_,. 
First we will use Cauchy’s formula to verify Eq. (4.123). If F(z) is the 
z-transform of f(k), then we write 


1 = dz 
T= — Dz'z—. 
Aj pris Zz : 


We assume that the series for F(z) converges uniformly on the contour of inte- 
gration, so the series can be integrated term by term. Thus we have 


ETE. 4, 42 
T= F 2,10 f: = 


The argument of the integral has no pole inside the contour if k —/ > 1, and it 
has zero residue at the pole at z = Oif k —/ < 0. Only if k = l does the integral 
have a residue, and that is 1. By Eq. (4.123), the integral is zero if k = / and is 
2x j if k = l. Thus Z = f(k), which demonstrates Eq. (4.122). 


22 If it is known that f (k) is causal, that is, f (k) = O for k < 0, then the region of convergence is outside 
the smallest circle that contains all the poles of F(z) for rational transforms. It is this property that 
permits inversion by partial-fraction expansion and long division. 
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To illustrate the use of Eq. (4.123) to compute the inverse of a z-transform, 
we will use the function z/(z — 1) and consider first the case of convergence for 
|z| > 1 and second the case of convergence for |z| < 1. For the first case 


l 
f(k) = $ =e QE (4.125) 
| 


27 aree — 1° 4 


where the contour is a circle of radius greater than 1. Suppose k < 0. In this case, 
the argument of the integral has two poles inside the contour: one at z = 1 with 


residue x 


z 
lhm(z — 1)—— = 1, 
fd z—1 


and one pole at z = 0 with residue found as in (2.109)(if k < 0, then z * removes 
the pole) 
FT. -1 


eal = z—1 
—-—(lctziz"-£r?4e cre) 
The residue is thus —1 for all k, and the sum of the residues is zero, and 
fik) = 0, k « 0. (4.126) 


For k > 0, the argument of the integral in Eq. (4.125) has only the pole at 
z = | with residue 1. Thus 


fí (21, k 7 0. (4.127) 
Equations (4.123) and (4.124) correspond to the unit-step function, as they should. 
We would write the inverse transform symbolically Z~'{.} as, in this case 


a 5] = 1(k) (4.128) 


when z/(z — 1) converges for |z| > 1. 
If, on the other hand, convergence is inside the unit circle, then for k > 0, 
there are no poles of the integrand contained in the contour, and 


f,)=0, k»0 


At k < 0, there is a pole at the origin of z, and as before, the residue is equal to 
—] there, so 


fk) = -l, k « 0. 
In symbols, corresponding to Eq. (4.128), we have 


{= | - i-i 


when z/(z — 1) converges for |z| < 1. 
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Although, as we have just seen, the inverse integral can be used to compute an 
expression for a sequence to which a transform corresponds, a more effective use 
of the integral is in more general manipulations. We consider one such case that 
will be of some interest later. First, we consider an expression for the transform 
of a product of two sequences. Suppose we have 


BE = f, 0) f, (0), 


and f, and f, are such that the transform of the product exists. An expression for 
F,(z) in terms of F,(z) and F,(z) can be developed as follows. By definition 


RO= J Aoz" 


k=—00 


From the inversion integral, Eq. (4.122), we can replace f,(k) by an integral 


co "E d 
F,(z) = Y». 50754 FG e. 


kz—oo [a 


We assume that we can find a region where we can exchange the summation with 
the integration. The contour will be called C, in this case 


oo zN—* dt 
Re) =z BO Y. AE) (3) 4. 


k=—0o 


The sum can now be recognized as F,(z/¢) and, when we substitute this, 


z\ d 
F3(z) = - Fy(C)Fi | =) +, 
3(z) ar A 2(¢) (2) C (4.129) 
the contour C, must be in the overlap of the convergence regions of F;(Z) and 
F,(z/¢). Then F,(z) will converge for the range of values of z for which C, can 

be found. 
If we let f, — f, and z — l in Eq. (4.129), we have the discrete version of 


Parseval's theorem, where convergence is on the unit circle 


ez 1 1\ dt 
F.(1) = $5 5(z)S. 4.130 
i0) pe x ORE (4.130) 


This particular theorem shows how we can compute the sum of squares of 
a time sequence by evaluating a complex integral in the z-domain. The result is 
useful in the design of systems by least squares. 
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4.6.3 


*Another Derivation of the Transfer Function 


Let D be a discrete system which maps an input sequence, (e(&)), into an output 
sequence {u(k)}.23 Then, expressing this as an operator on e(k), we have 


u(k) = D{e(k)}. 
If D is linear, then 
D{ae,(k) + Be,(k)} = aD{e, (k)} + BD{e,(k)}. (4.131) 


If the system is time invariant, a shift in e(k) to e(k + j) must result in no other 
effects but a shift in the response, u. We write 


Die(k + j)) 2 u(k + j) for all j (4.132) 


D{e(k)} = uk). 


Theorem 


If D is linear and time invariant and is given an input z* for a value of z for which 
the output is finite at time k, then the output will be of the form H (z)z*. 
In general, if e(k) = z*, then an arbitrary finite response can be written 


u(k) = A(z, k)z*. 


Consider e,(k) — z**! = ziz* for some fixed j. From Eq. (4.131), if we let 
&œ = z’, it must follow that 


u, = z/u(k) 

= zÍ H (z, k)z* 

EHET (4.133) 
From Eq. (4.132), we must have 

u, (k) = u(k + j) 
= H(z, j +k)" forall j. (4.134) 
From a comparison of Eqs. (4.133) and (4.134), it follows that 
H(z, k) = H(z, k + j) forall j 


that is, H does not depend on the second argument and can be written H (2). 
Thus for the elemental signal e(k) — z^, we have a solution u(k) of the same 
(exponential) shape but modulated by a ratio H(z), u(k) = H Ga 


23 This derivation was suggested by L. A. Zadeh in 1952 at Columbia University. 
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Can we represent a general signal as a linear sum (integral) of such elements? 
We can, by the inverse integral derived above, as follows 


e(k) = — feor, — (4.135) 
where 
Ezy- Y seh  relem (4.136) 


for signals with r « R for which Eq. (4.136) converges. We call E(z) the 
z-transform of e(k), and the (closed) path of integration is in the annular re- 
gion of convergence of Eq. (4.136). If e(k) = 0, k < 0, then R --> co, and this 
region is the whole z-plane outside a circle of finite radius. 

The consequences of linearity are that the response to a sum of signals is the 
sum of the responses as given in Eq. (4.131). Although Eq. (4.135) is the limit of 
a sum, the result still holds, and we can write 


] d 
u(k) = — F Eolresponse to zl, 
22) Z 
but, by the theorem, the response to z* is H(z)z*. Therefore we can write 


d 
ese” Peo 
2zj Z 


l ,az 
=—— PA(Z)E(z)z —. (4.137) 
2n j z 
We can define U (z) = H(z) E(z) by comparison with Eq. (4.135) and note that 
U(z) = Y^ u(k)e* = H(z) E(2). (4.138) 
k-—oo 


Thus H(z) is the transfer function, which is the ratio of the transforms of e(k) 
and u(k) as well as the amplitude response to inputs of the form z*. 

This derivation begins with linearity and stationarity and derives the z- 
transform as the natural tool of analysis from the fact that input signals in the form 
z* produce an output that has the same shape.” It is somewhat more satisfying 
to derive the necessary transform than to start with the transform and see what 
systems it is good for. Better to start with the problem and find a tool than start 
with a tool and look for a problem. Unfortunately, the direct approach requires 
extensive use of the inversion integral and more sophisticated analysis to develop 
the main result, which is Eq. (4.138). Chacun à son goût. 


24 Because z* is unchanged in shape by passage through the linear constant system, we say that z* isan 
eigenfunction of such systems. 
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4.7 Summary 


The z-transform can be used to solve discrete difference equations in the 
same way that the Laplace transform is used to solve continuous differential 
equations. 
The key property of the z-transform that allows solution of difference equa- 
tions is 

A Da FG). (4.113) 
A system will be stablein the sense that a Bounded Input will yield a Bounded 
Output (BIBO stability) if 


bI (4.35) 
l=- 


A discrete system can be defined by its transfer function (in z) or its state- 
space difference equation. 


The z-transform of the samples of a continuous system G(s) preceded by a 
zero-order-hold (ZOH) is 
G(s 
G(z)= (l -a»z |22] (4.41) 

which is typically evaluated using MATLAB's C2d.m. 
For the continuous state-space model 

x = Fx+ Gu, (4.45) 

y Hx + Ju, (4.46) 


preceded by a zero-order-hold, the discrete state-space difference equations 
are 


x(k + 1) = x(k) + Tuck), 


y(k) = Hx(k) + Ju(k), (4.59) 
where 
o = eE” 
T 
r= f e™"dnG, (4.58) 
0 


which can be evaluated by MATLAB's c2d.m. 
The discrete transfer function in terms of the state-space matrices is 


Lo 
UG) = H[zIl — 0] T, (4.64) 


which can be evaluated in MATLAB by the tf function. 
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* The characteristic behavior associated with poles in the z-plane is shown in 
Figs. 4.21 through 4.23 and summarized in Fig. 4.25. Responses are typically 
determined via MATLAB's impulse.m or step.m. 


* A system represented by H (z) has a discrete frequency response to sinusoids 
at w, given by an amplitude, A, and phase, yw, as 


A = |H(e/%")| and y = Z(H (es) (4.110) 


which can be evaluated by MATLAB's bode.m. 


e The discrete Final Value Theorem, for an F(z) that converges and has a final 
value, is given by 


jim Si) = lim(z 10952): (4.115) 


4.8 Problems 


4.1 Check the following for stability: 
(a) u(k) = 0.5u(k — 1) — 0.3u(k — 2) 
(b) u(k) = 1.6u(k —1) — u(k — 2) 
(c) u(k) =0.8u(k — 1) + 0.4u(k — 2) 


4.2 (a) Derive the difference equation corresponding to the approximation of integration 
found by fitting a parabola to the points e, ,, e, ,, e, and taking the area under this 
parabola between ¢ = kT — T and t = kT as the approximation to the integral of 
e(t) over this range. 


(b) Find the transfer function of the resulting discrete system and plot the poles and 
zeros in the z-plane. 


4.3 Verify that the transfer function of the system of Fig. 4.8(c) is given by the same H(z) 
as the system of Fig. 4.9(c). 


4.4 (a) Compute and plot the unit-pulse response of the system derived in Problem 4.2. 
(b) ls this system BIBO stable? 
4.5 Consider the difference equation 
u(k + 2) = 0.25u(X). 


(a) Assume a solution u(k) — A,z* and find the characteristic equation in z. 


(b) Find the characteristic roots z, and z, and decide if the equation solutions are stable 
or unstable. 


(c) Assume a general solution of the form 
u(k) = A,2z,* + A,z," 
and find A, and A, to match the initial conditions «(0) = 0, u(1) = 1. 
(d) Repeat parts (a), (b), and (c) for the equation 
u(k 4-2) = —0.25u(k). 
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4.6 


4.7 


4.8 


4.9 


(e) Repeat parts (a), (b), and (c) for the equation 
u(k +2) = u(k + 1) — 0.5u(k). 
Show that the characteristic equation 
z? — 2r cos(0)z +r? 

has the roots 

2,57 re^), 
(a) Use the method of block-diagram reduction, applying Figs. 4.5, 4.6, and 4.7 to 

compute the transfer function of Fig. 4.8(c). 

(b) Repeat part (a) for the diagram of Fig. 4.9(c). 
Use MATLAB to determine how many roots of the following are outside the unit circle. 
(a) 27+0.25=0 
(b) z - 1.127 +0.01z + 0.405 = 0 
(c) z’ —3.627+42-1.6=0 
Compute by hand and table look-up the discrete transfer function if the G(s) in Fig. 4.12 is 


(a) & 


3 
(b) s(s+3) 

3 
(O croces 
(d) cyn 
TR 
(e) Sp 


5 


( 42 


3e 1575 
6 cina» 
(h) Repeatthe calculation of these discrete transfer functions using MATLAB. Compute 
for the sampling period T = 0.05 and T = 0.5 and plot the location of the poles 


and zeros in the z-plane. 


4.10 Use MATLAB to compute the discrete transfer function if the G(s) in Fig. 4.12 is 


(a) the two-mass system with the non-colocated actuator and sensor of Eq. (A.21) with 
sampling periods T = 0.02 and T = 0.1. Plot the zeros and poles of the results in 
the z-plane. Let w, = 5,4, = 0.01. 

(b) the two-mass system with the colocated actuator and sensor given by Eq. (A.23). 
Use T = 0.02 and T = 0.1. Plot the zeros and poles of the results in the z-plane. 
Let w, =3,0,=3,9,=%,=0. 

(c) the two-input-two-output paper machine described in Eq. (A.24). Let T = 0.1 and 
T — 0.5. 


4.11 Consider the system described by the transfer function 


Y(s)) | " 
Us) | G-DG-3)y 
(a) Draw the block diagram corresponding to this system in control canonical form, 
define the state vector, and give the corresponding description matrices F, G, H, J. 


G(s) 


(b) 


(c) 


(d) 


(a) 
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Write G(s) in partial fractions and draw the corresponding parallel block diagram 
with each component part in control canonical form. Define the state € and give the 
corresponding state description matrices A, B, C, D. 


By finding the transfer functions X,/U and X,/U of part (a) in partial fraction 
form, express x, and x, in terms of &, and &, . Write these relations as the 
two-by-two transformation T such that x — T£. 


Verify that the matrices you have found are related by the formulas 


A= T^ FT, 
B-T'G, 
C- HT, 
D - J. 
4.12 The first-order system (z — o)/(1 — o)z has a zero at z = a. 
Plot the step response for this system for œ = 0.8, 0.9, 1.1, 1.2, 2. 
Plot the overshoot of this system on the same coordinates as those appearing in 


(b) 


(c) 


Fig. 4.30for 1 < a < 1. 


In what way is the step response of this system unusual for œ > 1? 


4.13 The one-sided z-transform is defined as 


(a) 


(b) 


(c) 
(d) 
(e) 


(f) 


Fl) = 9 fe". 
0 


Show that the one-sided transform of f(k + 1) is 
Z{f(k - 1) = 2F(z) — zf (0). 


Use the one-sided transform to solve for the transforms of the Fibonacci numbers 
by writing Eq. (4.4) as u,,, = u,,, + Up Let uy =u, = 1. [You will need to 
compute the transform of f (k + 2).] 

Compute the location of the poles of the transform of the Fibonacci numbers. 
Compute the inverse transform of the numbers. 


Show that if u, is the kth Fibonacci number, then the ratio u,,,/u, will go to 
(14 J/5)/2, the golden ratio of the Greeks. 


Show that if we add a forcing term, e(k), to Eq. (4.4) we can generate the Fibonacci 
numbers by a system that can be analyzed by the two-sided transform; i.e., let 
U, = U, tugz +e, and let e, = 54(k) (8,(k) = | at k = 0 and zero elsewhere). 
Take the two-sided transform and show the same U(z) results as in part (b). 


4.14 Substitute u = Az‘ and e = Bz into Eqs. (4.2) and (4.7) and show that the transfer 
functions, Eqs. (4.15) and (4.14), can be found in this way. 


4.15 Consider the transfer function 


(z + 1)(z? — 1.3z + 0.81) 
H(z) =. 5 — >. 
(z? — 1.2z + 0.5)(z? — 1.4z + 0.81) 


Draw a cascade realization, using observer canonical forms for second-order blocks and 
in such a way that the coefficients as shown in H(z) above are the parameters of the 
block diagram. 
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4.16 (a) Write the H(z) of Problem 4.15 in partial fractions in two terms of second order 
each, and draw a parallel realization, using the observer canonical form for 
each block and showing the coefficients of the partial-fraction expansion as the 
parameters of the realization. 

(b) Suppose the two factors in the denominator of H(z) were identical (say we change 
the 1.4 to 1.2 and the 0.81 to 0.5). What would the parallel realization be in this 
case? 

4.17 Show that the observer canonical form of the system equations shown in Fig. 4.9 can be 

written in the state-space form as given by Eq. (4.27). 

4.18 Draw out each block of Fig. 4.10 in (a) control and (b) observer canonical form. Write 

out the state-description matrices in each case. 

4.19 For a second-order system with damping ratio 0.5 and poles at an angle in the z-plane of 

9 = 30°, what percent overshoot to a step would you expect if the system had a zero at 

z, = 0.6? 

4.20 Consider a signal with the transform (which converges for |z| > 2) 

z 
U (z) = ————. 
el (z — 1)(z — 2) 
(a) What value is given by the formula (Final Value Theorem) of (4.115) applied to 
this U(z)? 

(b) Find the final value of u(k) by taking the inverse transform of U(z), using 

partial-fraction expansion and the tables. 

(c) Explain why the two results of (a) and (b) differ. 


4.21 (a) Find the z-transform and be sure to give the region of convergence for the signal 
u(k) 2 r8, [p< t. 


[Hint: Write u as the sum of two functions, one for k > 0 and one for k < 0, 
find the individual transforms, and determine values of z for which both terms 
converge.] 


(b) Ifa rational function U(z) is known to converge on the unit circle |z| = 1, show 
how partial-fraction expansion can be used to compute the inverse transform. Apply 
your result to the transform you found in part (a). 


4.22 Compute the inverse transform, f (k), for each of the following transforms: 


(a) F(z) = wher, Iz) > 1; 
(b) F(z) em dab 
() Fe) =i duel 
(d) F(z2- ep 1/2 < |z| < 2. 
4.23 Ms Ee to plot the time sequence associated with each of the transforms in Prob- 


4.24 Use the z-transform to solve the difference equation 


y(k) — 3y(k — 1) -2y(k — 2) = 2u(k — 1) —2u(k — 2), 
k, k20 
BIW Gy. S 
y(k) = 0, k « Q. 
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4.25 For the difference equation in Problem 4.24, solve using MATLAB. 
4.26 Compute by hand and table look-up the discrete transfer function if the G(s) in Fig. 4.12 is 
obj- dE. 
s“(s + 10) 
and the sample period is 7 = 10 msec. Verify the calculation using MATLAB. 
4.27 Find the discrete state-space model for the system in Problem 4.26. 
4.28 Compute by hand and table look-up the discrete transfer function if the G(s) in Fig. 4.12 is 
10(s + 1) 
sti+s+10 
and the sample period is 7 = 10 msec. Verify the calculation using MATLAB and find the 
DC gain of both the G(s) and the G(z). 


4.29 Find the discrete state-space model for the system in Problem 4.28. Then compute the 
eigenvalues of d» and the transmission zeros of the state-space model. 


4.30 Find the state-space model for Fig. 4.12 with 


G(s) = 


l 
G(s) = S 


where there is a one cycle delay after the A/D converter. 


Sampled-Data Systems 


A Perspective on Sampled-Data Systems 


The use of digital logic or digital computers to calculate a control action for 
a continuous, dynamic system introduces the fundamental operation of sam- 
pling. Samples are taken from the continuous physical signals such as po- 
sition, velocity, or temperature and these samples are used in the computer 
to calculate the controls to be applied. Systems where discrete signals ap- 
pear in some places and continuous signals occur in other parts are called 
sampled-data systems because continuous data are sampled before being used. 
In many ways the analysis of a purely continuous system or of a purely dis- 
crete system is simpler than that of sampled-data systems. The analysis of 
linear, time-invariant continuous systems can be done with the Laplace 
trans-form and the analysis of linear time-invariant discrete systems can be 
done with the z — transform alone. If one is willing to restrict attention to 
only the samples of all the signals in a digital control one can do much 
useful analysis and design on the system as a purely discrete system using 
the z-transform. However the physical reality is that the computer 
operations are on discrete signals while the plant signals are in the 
continuous world and in order to consider the behavior of the plant 
between sampling instants, it is necessary to consider both the discrete 
actions of the computer and the con-tinuous response of the plant. Thus the 
role of sampling and the conversion from continuous to discrete and back 
from discrete to continuous are very important to the understanding of the 
complete response of digital control, and we must study the process of 
sampling and how to make mathematical models of analog-to-digital 
conversion and digital-to-analog conversion. This analysis requires careful 
treatment using the Fourier transform but the effort is well rewarded with the 
understanding it provides of sampled-data systems. 
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5.1 


Chapter Overview 


In this chapter, we introduce the analysis of the sampling process and describe 
both a time-domain and a frequency-domain representation. We also describe the 
companion process of data extrapolation or data holding to construct a continuous 
time signal from samples. As part of this analysis we show that a sampled-data 
system is made time varying by the introduction of sampling, and thus it is 
not possible to describe such systems exactly by a continuous-time transfer 
function. However, a continuous signal is recovered by the hold process and 
we can approximate the sinusoidal response of a sampler and hold by fitting 
another sinusoid of the same frequency to the complete response. We show how 
to compute this best-fit sinusoidal response analytically and use it to obtain a 
good approximation to a transfer function. For those familiar with the idea, this 
approach is equivalent to the use of the “describing function” that is used to 
approximate a transfer function for simple nonlinear systems. In Section 5.1 the 
analysis of the sample and hold operation is considered and in Section 5.2 the 
frequency analysis of a sampled signal is given. Here the important phenomenon 
of signal aliasing caused by sampling is introduced. In Section 5.3 the zero-order 
hold and some of its generalizations are considered. Analysis of sampled-data 
systems in the frequency domain is introduced in Section 5.4 including block 
diagram analysis of these combined systems. Finally in Section 5.5 computation 
of intersample ripple is discussed. 


Analysis of the Sample and Hold 


To get samples of a physical signal such as a position or a velocity into digital 
form, we typically have a sensor that produces a voltage proportional to the 
physical variable and an analog-to-digital converter, commonly called an A/D 
converter or ADC, that transforms the voltage into a digital number. The physical 
conversion always takes a non-zero time, and in many instances this time is 
significant with respect to the sample period of the control or with respect to 
the rate of change of the signal being sampled. In order to give the computer 
an accurate representation of the signal exactly at the sampling instants KT, the 
A/D converter is typically preceded by a sample-and-hold circuit (SHC). A 
simple electronic schematic is sketched in Fig. 5.1, where the switch, S, is an 
electronic device driven by simple logic from a clock. Its operation is described 
in the following paragraph. 

With the switch, S, in position 1, the amplifier output v „(ź) tracks the input 
voltage v,,(¢) through the transfer function 1/(RCs + 1). The circuit bandwidth 
of the SHC, 1/RC, is selected to be high compared to the input signal bandwidth. 
Typical values are R = 1000 ohms, C = 30 x 10^" farads for a bandwidth of 
f = 1/27 RC = 5.3 MHz. During this “tracking time,” the ADC is turned off 
and ignores V „w: When a sample is to be taken at t = KT the switch S is set 


Figure 5.1 
Analog-to-digital 
converter with sample 
and hold 
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H = Hold; S in position 2. 
T = Track; S in position I. 


to position 2 and the capacitor C holds the output of the operational amplifier 
frozen from that time at v,, (k T) = v, (kT). The ADC is now signaled to begin 
conversion of the constant input from the SHC into a digital number which 
will be a true representation of the input voltage at the sample instant. When 
the conversion is completed, the digital number is presented to the computer 
at which time the calculations based on this sample value can begin. The SHC 
switch is now moved to position 1, and the circuit is again tracking, waiting for 
the next command to freeze a sample. The SHC needs only to hold the voltage 
for a short time on the order of microseconds in order for the conversion to be 
completed before it starts tracking again. The value converted is held inside the 
computer for the entire sampling period of the system, so the combination of 
the electronic SHC plus the ADC operate as a sample-and-hold for the sampling 
period, 7, which may be many milliseconds long. The number obtained by the 
ADC is aquantized version of the signal represented in a finite number of bits, 12 
being a typical number. As a result, the device is nonlinear. However, the signals 
are typically large with respect to the smallest quantum and the effect of this 
nonlinearity can be ignored in a first analysis. A detailed study of quantization is 
included in Chapter 10. 

For the purpose of the analysis, we separate the sample and hold into two 
mathematical operations: a sampling operation represented by impulse modula- 
tion and a hold operation represented as a linear filter. The symbol or schematic of 
the ideal sampler is shown in Fig. 5.2; its role is to give a mathematical represen- 
tation of the process of taking periodic samples from r(t) to produce r(k T) and 
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Figure 5.2 
The sampler 
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oo 
r(t) r'()* »ru)6( — kt) 
lt NU DIO $ P 


T 


to do this in such a way that we can include the sampled signals in the analysis of 
continuous signals using the Laplace transform.' The technique is to use impulse 
modulation as the mathematical representation of sampling. Thus, from Fig. 5.2, 
we picture the output of the sampler as a string of impulses 
oo 
r'()-2 Y rna - kT). (5.1) 
kz—oo 

The impulse can be visualized as the limit of a pulse of unit area that has growing 
amplitude and shrinking duration. The essential property of the impulse is the 
sifting property that 


f f(t)d(t — a)dt = f(a) (5.2) 


for all functions f that are continuous at a. The integral of the impulse is the unit 
step 


i ó(r)dt = I(t), (5.3) 
—00 
and the Laplace transform of the unit impulse is 1, because 
oo 
L{d(t)} =| d(t)e “dt =1. (5.4) 


Using these properties we can see that r*(t), defined in Eq. (5.1 ), depends only 
on the discrete sample values r(kT). The Laplace transform of r*(t) can be 
computed as follows 


L{r*(t)} aj r*(t)e™dr. 


oo 


If we substitute Eq. (5.1) for r' (t), we obtain 


=f wi r(t)d(t — kT)e "ar, 


co (2—oc 
and now, exchanging integration and summation and using Eq. (5.2), we have 


oo 


R*(s) = x” r(kT)e-**7, (5.5) 


——00 


] We assume that the reader has some familiarity with Fourier and Laplace transform analysis. A general 
reference is Bracewell (1978). 


Figure 5.3 
The sample and hold, 


showing typical signals. 


(a) Input signal r; 
(b) sampled signal r*; 
(c) output signal r,; 


(d) sample and hold 
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The notation R*(s) is used to symbolize the (Laplace) transform of r*(t), the 
sampled or impulse-modulated r (t).? Notice that if the signal r(t) in Eq. (5.1) is 
shifted a small amount then different samples will be selected by the sampling 
process for the output proving that sampling is not a time-invariant process. 
Consequently one must be very careful in using transform analysis in this context. 

Having a model of the sampling operation as impulse modulation, we need 
to model the hold operation to complete the description of the physical sample- 
and-hold which will take the impulses that are produced by the mathematical 
sampler and produce the piecewise constant output of the device. Typical signals 
are sketched in Fig. 5.3. Once the samples are taken, as represented by r*(t) in 


(c) (d) 


2 It will be necessary, from time to time, to consider sampling a signal that is not continuous. The only 
case we will consider will be equivalent to applying a step function, 1 (t), to a sampler. For the purposes 
of this book we will define the unit step to be continuous from the right and assume that the impulse, 
(t), picks up the full value of unity. By this convention and Eq. (5.1) we compute 


1*0) =A 1o —kT), (a) 
k=0 


and, using Eq. (5.2), we obtain 
LUD} = 1/0 —e7"). (b) 


The reader should be warned that the Fourier integral converges to the average value of a function 
at a discontinuity and not the value approached from the right as we assume. Because our use of 
the transform theory is elementary and the convenience of equation (b) above is substantial, we have 
selected the continuous-from-the-right convention. In case of doubt, the discontinuous term should be 
separated and treated by special analysis, perhaps in the time domain. 
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5.2 


Eq. (5.1), the hold is defined as the means whereby these impulses are extrapolated 
to the piecewise constant signal r, (t), defined as 


r,(t) — r(kT) kT €t «kT «t T. (5.6) 


A general technique of data extrapolation from samples is to use a polynomial 
fit to the past samples. If the extrapolation is done by a constant, which is a 
zero-order polynomial, then the extrapolator is called a zero-order hold, and its 
transfer function is designated as ZO H (s). We can compute ZO H (s) as the 
transform of its impulse response.” If r"(t) = ó(t), then r,(t), which is now the 
impulse response of the Z OH, is a pulse of height 1 and duration T seconds. 
The mathematical representation of the impulse response is simply 


p(t) =1(t)-1@—-T). 
The required transfer function is the Laplace transform of p(t) as 
ZOH(s) = L{p(t)} 
= [ue — l(t - T)je "dt 
- ens. (5.7) 


Thus the linear behavior of an A/D converter with sample and hold can be modeled 
by Fig. 5.3. We must emphasize that the impulsive signal r*(t) in Fig. 5.3 is not 
expected to represent a physical signal in the A/D converter circuit; rather it is 
a hypothetical signal introduced to allow us to obtain a transfer-function model 
of the hold operation and to give an input-output model of the sample-and-hold 
suitable for transform and other linear systems analysis. 


Spectrum of a Sampled Signal 


We can get further insight into the process of sampling by an alternative repre- 
sentation of the transform of r*(t) using Fourier analysis. From Eq. (5.1) we see 
that r*(t) is a product of r(t) and the train of impulses, 26(t — kT). The latter 
series, being periodic, can be represented by a Fourier series 

co eo 
308-8) = De Orel, 


where the Fourier coefficients, C, are given by the integral over one period as 


1 TA co 
€. = | VA ó(t - kT)e-?"ezt/Da,. 


T/2 k-—oo 


3 The hold filter in Fig. 5.3(d) will receive one unit-size impulse if the input signal is zero at every sample 
time except f = 0 and is equal to 1 there. That is, if r(A7) = 0, k =O and r(0) = I. 
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The only term in the sum of impulses that is in the range of the integral is 
the 5(r) at the origin, so the integral reduces to 


1 T/2 
OR = “ah 8(r)e ?" 9x" Dg. 
T J-rj 


but the sifting property from Eq. (5.2) makes this easy to integrate, with the result 


Thus we have derived the representation for the sum of impulses as a Fourier 
series 
"t PENNE Qxn/ T)? 
2o at kT) = 3 px e (5.8) 
We define w, = 27 /T as the sampling frequency (in radians per second) and 
now substitute Eq. (5.8) into Eq. (5.1) using w,. We take the Laplace transform 
of the output of the mathematical sampler, 


£p f r(t) B » enjeu 


n=- 


and integrate the sum, term by term to obtain 


3 l co eo ; 
R*(s) = 7 ne f r(r)e/" edt. 


n=-o0o0"7 


If we combine the exponentials in the integral, we get 


» a d ; 
R (s) — T >, i r(t)e 69g, 


n-Éz-—oo* ~ 


The integral here is the Laplace transform of r(t) with only a change of variable 
where the frequency goes. The result can therefore be written as 
* l z = 

R*(s) = 7 R(s — jno,), (5.9) 
where R(s) is the transform of r(r). In communication or radio engineering 
terms, Eq. (5.8) expresses the fact that the impulse train corresponds to an infinite 
sequence of carrier frequencies at integral values of 27/7, and Eq. (5.9) shows 
that when r(t) modulates all these carriers, it produces a never-ending train of 
sidebands. A sketch of the elements in the sum given in Eq. (5.9) is shown in 
Fig. 5.4. 

An important feature of sampling, shown in Fig. 5.4, is illustrated at the 
frequency marked w,. Two curves are drawn representing two of the elements 
that enter into the sum given in Eq. (5.9). The value of the larger amplitude 
component located at the frequency w, is the value of R(jw,). The smaller 
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Figure 5.4 

(a) Sketch of a spectrum 
amplitude and (b) the 
components of the 
spectrum after sampling, 
showing aliasing 


aliasing 


component shown at w, comes from the spectrum centered at 27/7 and is 
R(jax), where w is such that w = w, — 2z/T. This signal at frequency «y 
which produces a component at frequency w, after sampling is called in the trade 
an "alias" of w,; the phenomenon is called aliasing. 

The phenomenon of aliasing has a clear meaning in time. Two continuous 
sinusoids of different frequencies appear at the same frequency when sampled. 
We cannot, therefore, distinguish between them based on their samples alone. 
Figure 5.5 shows a plot of a sinusoid at i Hz and of a sinusoid at z Hz. If we 
sample these waves at 1 Hz, as indicated by the dots, then we get the same sample 
values from both signals and would continue to get the same sample values for 
all time. Note that the sampling frequency is 1, and, if f, = 1, then 


The significance of the negative frequency is that the 2-Hz sinusoid in Fig. 5.5 is 
a negative sine function. 

Thus, as a direct result of the sampling operation, when data are sampled at 
frequency 2x / T, the total harmonic content at a given frequency c, is to be found 
not only from the original signal at w, but also from all those frequencies that are 
aliases of w, namely, components from all frequencies w, + n2z/ T = w, + no, 
as shown in the formula of Eq. (5.9) and sketched in Fig. 5.4. The errors caused by 
aliasing can be very severe if a substantial quantity of high-frequency components 
is contained in the signal to be sampled. To minimize the error caused by this 


Figure 5.5 

Plot of two sinusoids 
that have identical 
values at unit sampling 
intervals—an example of 
aliasing 


sampling theorem 


hidden oscillations 
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effect, it is standard practice to precede the sampling operation (such as the 
sample-and-hold circuit discussed earlier) by a low-pass antialias filter that will 
remove substantially all spectral content above the half-sampling frequency, i.e., 
above 2z/T. A sketch suggesting the result of an anti-aliasing filter is drawn in 
Fig. 5.6. 

If all spectral content above the frequency 7/T is removed, then no aliasing 
is introduced by sampling and the signal spectrum is not distorted, even though it 
is repeated endlessly, centered at n22/T. The critical frequency, 2/7, was first 
reported by H. Nyquist and is called the Nyquist frequency. Band-limited signals 
that have no components above the Nyquist frequency are represented unambigu- 
ously by their samples. A corollary to the aliasing issue is the sampling theorem. 
We have seen that if R(jw) has components above the Nyquist frequency c, /2 or 
7. / T , then overlap and aliasing will occur. Conversely, we noticed that if R( jc) is 
zero for |w| > zx/T, then sampling at intervals of T sec. will produce no aliasing 
and the original spectrum can be recovered exactly from R*, the spectrum of the 
samples. Once the spectrum is recovered by inverse transform, we can calculate 
the original signal itself. This is the sampling theorem: One can recover a signal 
from its samples if the sampling frequency (w, = 27 / T) is at least twice the 
highest frequency (7/7) in the signal. Notice that the sampling theorem requires 
that R(jw) is exactly zero for all frequencies above x/T. 

A phenomenon somewhat related to aliasing is that of hidden oscillations. 
There is the possibility that a signal could contain some frequencies that the 
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Figure 5.6 


(a) Sketch of a spectrum 
amplitude and (b) the 


components of the 


spectrum after sampling, 


showing removal of 
aliasing with an 
antialiasing filter 
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samples do not show at all. Such signals, when they occur in a digital control 
system, are called “hidden oscillations,’ an example of which is shown in a 
design problem in Fig. 7.29. Hidden oscillations can only occur at multiples of 
the Nyquist frequency (z/T). 


Data Extrapolation 


The sampling theorem states that under the right conditions it is possible to 
recover a signal from its samples; we now consider a formula for doing so. 
From Fig. 5.6 we can see that the spectrum of R(jo) is contained in the low- 
frequency part of R*(jw). Therefore, to recover R(jo) we need only process 
R*(jo) through a low-pass filter and multiply by T. As a matter of fact, if R(jw) 
has zero energy for frequencies in the bands above z/T (such an R is said to be 
band-limited), then an ideal low-pass filter with gain T for —x/T <w < n/T 
and zero elsewhere would recover R(jw) from R*(jw) exactly. Suppose we 
define this ideal low-pass filter characteristic as L(jw). Then we have the result 


R(jo) = L(jw)R* jo). (5.10) 


The signal r(t) is the inverse transform of R(jw), and because by Eq. (5.10) 
R(jo) is the product of two transforms, its inverse transform r(t) must be the 
convolution of the time functions £(ft) and r* (1r). The form of the filter impulse 
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response can be computed by using the definition of L(jw) from which the 
inverse transform gives 


] x/T 
£(t) = =l Te’ do 
PIEJ aiT 
E T ei”! IT 
Qe jt 'T 
d JT) _ oiT) 
^ Onjt M s ) 
. Sin(rt/T) 
ie Fue 
A sinc Z (5.11) 
= sinc —. s 
F 
Using Eq. (5.1) for r*(t) and Eq. (5.11) for £(t), we find that their convolution is 
oo oo 
t -— 
r(t) zi r(t) A. ó(t erian c Des. 
s oa T 
Using the sifting property of the impulse, we have 
B . n(t —kT) 
"(t) — (KT = .12 
r(t) Sol )sinc T (5.12) 


kz—oo 
Equation (5.12) is a constructive statement of the sampling theorem. It shows 
explicitly how to construct a band-limited function r(t) from its samples. The 
sinc functions are the interpolators that fill in the time gaps between samples with 
a signal that has no frequencies above zx /7T. A plot of the impulse response of 
this "ideal" hold filter is drawn in Fig. 5.7 from the formula of Eq. (5.11). 

There is one serious drawback to the extrapolating signal given by Eq. (5.11). 
Because £(r) is the impulse response of the ideal low-pass filter L(j c), it follows 
that this filter is noncausal because £(t) is nonzero for t < 0. £(t) starts at 
t = —oo while the impulse that triggers it does not occur until £ = 0! In many 
communications problems the interpolated signal is not needed until well after the 
samples are acquired, and the noncausality can be overcome by adding a phase 
lag, e /*^, to L(jw), which adds a delay to the filter and to the signals processed 
through it. In feedback control systems, a large delay is usually disastrous for 
stability, so we avoid such approximations to this function and use something 
else, like the polynomial holds, of which the zero-order hold already mentioned 
in connection with the ADC is the most elementary and the most common. 

In Section 5.2 we introduced the zero-order hold as a model for the storage 
register in an A/D converter that maintains a constant signal value between 
samples. We showed in Eq. (5.7) that it has the transfer function 

= eet 


ZOH(jw) = ————. (5.13) 
jw 
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Figure 5.7 

Plot of the impulse 
response of the ideal 
low-pass filter 


We can discover the frequency properties of ZOH(jw) by expressing Eq. 
(5.13) in magnitude and phase form. To do this, we factor out e /^"7 and multiply 
and divide by 2j to write the transfer function in the form 


| eletz o7 jern | 2j 


ZOH (jw) = e*t? - 
2j 


jo 


The term in brackets is recognized as the sine, so this can be written 


ZOH(jo) = Ze-ieth sin(wT /2) 
oT /2 
and, using the definition of the sinc function, 
ZOH (jw) = e """?Tsinc(oT/2). (5.14) 


Thus the effect of the zero-order hold is to introduce a phase shift of wT /2, which 
corresponds to a time delay of T'/2 seconds, and to multiply the gain by a function 
with the magnitude of sinc(w7/2). A plot of the magnitude is shown in Fig. 5.8, 
which illustrates the fact that although the zero-order hold is a low-pass filter, 
it has a cut-off frequency well beyond the Nyquist frequency. The magnitude 
function is 

IZOH(jw)| =T : (5.15) 


. oT 
sinc —— 
2 


Figure 5.8 

(a) Magnitude and (b) 
phase of polynomial 
hold filters 
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which slowly gets smaller as gets larger until it is zero for the first time at 
w =w, = 2z/T. The phase is 


ZZOH(jo) = =, (5.16) 


plus the 180° shifts where the sinc function changes sign. 

We can now give a complete analysis of the sample-and-hold circuit of 
Fig. 5.3(d) for a sinusoidal input r(t) in both the time and the frequency domains. 
We consider first the time domain, which is simpler, being just an exercise in 
construction. For purposes of illustration, we will use r(?) = 3 sin(50t + 2/6) 
as plotted in Fig. 5.9. If we sample r(?) at the instants kT where the sampling 
frequency is w, = 2x/T = 200x and T = 0.01, then the plot of the resulting 
r, (kT) is as shown in Fig. 5.9. Notice that although the input is a single sinusoid, 
the output is clearly nor sinusoidal. Thus it is not possible to describe this system 
by a transfer function, because the fundamental property of linear, time-invariant 
systems is that a sinusoid input produces an output that is a sinusoid of the 
same frequency and the relative amplitudes and phases determine the transfer 
function. The sample-and-hold system is linear but time varying. In the frequency 
domain, it is clear that the output r, (7) contains more than one frequency, and a 
complete analysis requires that we compute the amplitudes and phases of them 
all. However, in the application to control systems, the output of the hold will 
typically be applied to a dynamical system that is of low-pass character; thus 
the most important component in r, (f) is the fundamental harmonic, at w, = 50 
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Figure 5.9 

Plot of 3 sin(SOt + 2/6) 
and the output of a 
sample-and-hold with 
sample period T = 0.01 


rad/sec in this case. The other harmonics are impostors, appearing as part of the 
output signal when they are really unwanted consequences of the sample-and- 
hold process. In any event, we can proceed to analyze r,(t) for all its harmonics 
and select out the fundamental component, either by analysis or by a low-pass 
smoothing filter. 

First, we need the spectrum of r(t). Because a sinusoid can be decomposed 
into two exponentials, the spectrum of r(t) = A cos(w,t + $) is given by two 
impulse functions at w, and —w, of intensity 7 A and phase $ and —@ as* 


R(jo) = wAle’*5(w — w,) +e lw + c,)]. 


A sketch of this spectrum is shown in Fig 5.10(a) for A = 1/m. We represent 
the impulses by arrows whose heights are proportional to the intensities of the 
impulses. 

After sampling, as we saw in Eq. (5.9), the spectrum of R* is directly derived 
from that of R as the sum of multiple copies of that of R shifted by n2z/T for all 
integers n and multiplied by 1/ T. A plot of the result normalized by T is shown 
in Fig. 5.10(b). Finally, to find the spectrum of R,, we need only multiply the 
spectrum of R* by the transfer function Z OH (jw), which is 


ZOH(jo) = Te """?sinc(oT/2). 


4 See the appendix to this chapter for details. 


Figure 5.10 
Plot of the spectra of 
(a) R; (b) f; and (c) R, 
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Thus the spectrum of R, is also a sum of an infinite number of terms, but now 
with intensities modified by the sinc function and phases shifted by the delay 
function wT /2. These intensities are plotted in Fig. 5.10(c). Naturally, when all 
the harmonics included in R, are converted to their time functions and added, 
they sum to the piecewise-constant staircase function plotted earlier in Fig. 5.9. 

If we want a best approximation to r, using only one sinusoid, we need 
only take out the first or fundamental harmonic from the components of R*. This 
component has phase shift $ and amplitude A sinc(cT /2). In the time domain, 
the corresponding sinusoid is given by 


v(t) = A[sinc(oT/2)]sin[o,(t — 2 (5.17) 


A plot of this approximation for the signal from Fig 5.9 is given in Fig. 5.11 
along with both the original input and the sampled-and-held output to show 
the nature of the approximation. In control design, we can frequently achieve a 
satisfactory design for a sampled-data system by approximating the sample and 
hold with a continuous transfer function corresponding to the delay of 7/2. The 
controller design is then done in the continuous domain but is implemented by 
computing a discrete equivalent. More discussion of this technique, sometimes 
called emulation, will be given in Chapter 6, where some examples illustrate the 
results. 
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Figure 5.11 

Plot of the output of the 
sample and hold and the 
first harmonic 
approximation 


Xt 


Figure 5.12 
A cascade of samplers 
and filters 


"T 


Block-Diagram Analysis of Sampled-Data 
Systems 


We have thus far talked mainly about discrete, continuous, and sampled signals. 
To analyze a feedback system that contains a digital computer, we need to be able 
to compute the transforms of output signals of systems that contain sampling 
operations in various places, including feedback loops, in the block diagram. 
The technique for doing this is a simple extension of the ideas of block-diagram 
analysis of systems that are all continuous or all discrete, but one or two rules 
need to be carefully observed to assure success. First, we should review the facts 
of sampled-signal analysis. 

We represent the process of sampling a continuous signal and holding it by 
impulse modulation followed by low-pass filtering. For example, the system of 
Fig. 5.12 leads to 


E(s) = R'(s) H(s), 
U (s) = E*(s)G(s). (5.18) 


M An 4 a 


R* (s) E(s) mL Us) | U*(s) 
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Impulse modulation of continuous-time signals like e(t) and u(t) produces a 
series of sidebands as given in Eq. (5.9) and plotted in Fig. 5.4, which result in 
periodic functions of frequency. If the transform of the signal to be sampled is a 
product of a transform that is already periodic of period 27 / T, and one that is 
not, as in U(s) = E*(s)G(s), where E*(s) is periodic and G(s) is not, we can 
show that E*(s) comes out as a factor of the result. This is the most important 
relation for the block-diagram analysis of sampled-data systems, namely? 


U*(s) = (E'(s)G(s))* = E'(s)G' (s). (5.19) 


We can prove Eq. (5.19) either in the frequency domain, using Eq. (5.9), or 
in the time domain, using Eq. (5.1) and convolution. We will use Eq. (5.9) here. 
If U(s) = E*(s)G(s), then by definition we have 


U*(s) — = > E*(s — jnw,)G(s — jnw,); (5.20) 
but E*(s) is 
1 2 
E'G)- 7 E» E(s — jke,), 
so that 
E'(s — jnw,) = 7 3» E(s — jkw, — jno,). (5.21) 


Now in Eq. (5.21) we can let k = £ — n to get 


j' 4e l 
E'( — jno) = 3 X EG - jlw,) 
l=- 


In other words, because E* is already periodic, shifting it an integral number of 
periods leaves it unchanged. Substituting Eq. (5.22) into Eq. (5.20) yields 


U*(s) = EG 2; G(s — jno,) 
= E'(s)G'(s). QED (5.23) 
Note especially what is nor true. If U(s) = E(s)G(s), then U*(s) + E*(s)G" (s) 
but rather U* (s) = (EG)*(s). The periodic character of E* in Eq. (5.19) is crucial. 


5 We of course assume the existence of U” (s), which is assured if G(s) tends to zero as s tends to infinity 
at least as fast as 1/s. We must be careful to avoid impulse modulation of impulses, for 6(t)8(t) is 
undefined. 
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Figure 5.13 


Block diagram of digital control as a sampled-data system 


| Model of 
| AD 
| samplin 
| 


@ Example 5.1 


Modelof | Model of | Plant | 
computer | D/A | | 
progam | converter | | 

| | | 


The final result we require is that, given a sampled-signal transform such as 
U* (s), we can find the corresponding z-transform simply by letting e7 = z or 


U(z) = U"(s) lur, ,- (5.24) 


There is an important time-domain reflection of Eq. (5.24). The inverse Laplace 
transform of U*(s) is the sequence of impulses with intensities given by u(k T); 
the inverse z-transform of U (z) is the sequence of values u(kT). Conceptually, 
sequences of values and the corresponding z-transforms are easy to think about 
as being processed by a computer program, whereas the model of sampling as a 
sequence of impulses is what allows us to analyze a discrete system embedded 
in a continuous world (see Fig. 5.13). Of course, the impulse modulator must 
always be eventually followed by a low-pass circuit (hold circuit) in the physical 
world. Note that Eq. (5.24) can also be used in the other direction to obtain U*(s), 
the Laplace transform of the train of impulses, from a given U (2). 


Block Diagram Analysis 


Compute the transforms of Y* and Y for the system block diagram of Fig. 5.13. 


Solution. In Fig. 5.13 we have modeled the A/D converter plus computer program plus D/A 
converter as an impulse modulator {which takes the samples from e(t)],a computer program 
that processes these samples described by D*(s), and a zero-order hold that constructs the 
piecewise-constant output of the D/A converter from the impulses of m^. In the actual computer 
we assume that the samples of e(t) are manipulated by a difference equation whose input- 
output effect is described by thez -transform D(z). These operations are represented in Fig. 5.13 
as if they were performed on impulses, and hence the transfer function is D*(s) according to 
Eq. (5.24). Finally, the manipulated impulses, m*(t), are applied to the zero-order hold from 
which the piecewise-constant-control signal u(t) comes. In reality, of course, the computer 
operates on the sample values of e(t) and the piecewise-constant output is generated via a 
storage register and a D/A converter. The impulses provide us with a convenient, consistent, 
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and effective model of the processes to which Laplace-transform methods can be applied. 
From the results given thus far, we can write relations among Laplace transforms as 


E(s)=R — Y, (a) 
M* (s)  E*D*, (b) 
NS ip 
Dou [| E (5.25) 
Y=GU. (d) 


The usual idea is to relate the discrete output, Y*, to the discrete input, R*. Suppose we 
sample each of these equations by using the results of Eq. (5.19) to "star" each transform. The 
equations are © 


E*=R*-—Y*, (a) 
M*=E*D", (b) 
Ut=M", (c) 
Y'-[GUyY. (d) 


(5.26) 


Now Eq. (5.26(d)) indicates that we need U, not U*, to compute Y*, so we must back up to 
substitute Eq. (5.25(c)) into Eq. (5.26(d)): 


27$ * 
y* = [em (^ = ) (5.27) 


Taking out the periodic parts, which are those in which s appears only as e^? [which 
include M*(s)], we have 


Y* 2(1— e )w* ($). (5.28) 
Substituting from Eq. (5.26(b)) for M* gives 
y* =. — YET D (G/N (5.29) 
And substituting Eq. (5.26(a)) for E* yields 
Y* = (1 —e7"5)D*(G/s)*[R* — Y*]. (5.30) 
If we call 
(1 — e775) D*(G/s)* = H*, (5.31) 


then we can solve Eq. (5.30) for Y*, obtaining 


6 In sampling Eq. (5.25(c)) we obtain Eq. (5.26(c)) by use of the continuous-from-the-right convention 
for Eq. (5.5) for impulse modulation of discontinuous functions. From the time-domain operation of 
the zero-order hold, it is clear that the samples of « and m are the same, and then from this Eq. (5.26(c)) 
follows. 
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€ Example 5.2 


Analysis of a Specific Block Diagram 


Apply the results of Example 1 to compute Y* and Y forthe case where 


a 

Sce aU (5.33) 

and the sampling period T is such that e7^" = 
discrete integrator 


+ 


a 
1 
3. The computer program corresponds to a 


u(kT) = u(kT — T) + Kye(kT), (5.34) 


and the computer D/A holds the output constant so that the zero-order hold is the correct 
model. 


Solution. We wish to compute the components of H* given in Eq. ( 5.31). For the computer 
program we have the transfer function of Eq. (5.34), which in terms of z is 
U K K 
De) = 22. = Av, LE 
EG) 1l-z 2-1 


Using Eq. (5.24), we get the Laplace-transform form 


eT 
D*(s) = -7 (5.35) 
For the plant and zero-order-hold we require 
1 -Ts G t — -Ts a 
(1 —e7"*)(G(s)/s)* = (1 —e77) [Lv TE 
1 1 . 
= (l-e ™){-— , 
( die; G S 2) 
Using Eq. (5.5), we have 
* - l 1 
(1 —e-7*)(G(s)/s)* = (1 - e) (<i zd ive) i 
Because e^?" = i this reduces to 
, (1/2)e7"* 
1 — e7")(G(s)/s)* = ————— 
( MG(/s)" = P 
1/2 : 
-—————. 5.36 
e™ — 1/2 (939) 
Combining Eq. (5.36) and Eq. (5.35) then, in this case, we obtain 
K e 
H*(s) = 2$ — — — ——., 5.37 
(e) 2 Geer 172) Gen 


Equation (5.37) can now be used in Eq. (5.32) to find the closed-loop transfer function from 
which the dynamic and static responses can be studied, as a function of K,, the program gain. 
We note also that beginning with Eq. (5.25), we can readily calculate that 


D (leg 


Y(s) = R* 
G) Dx 


G(s). (5.38) 


* 
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Equation (5.38) shows how to compute the response of this system in between 
sampling instants. For a given r(t), the starred terms in Eq. (5.38) and the 
(1 — e77*)-term correspond to a train of impulses whose individual values can be 
computed by expanding in powers of e^^*. These impulses are applied to G(s)/s, 
which is the step response of the plant. Thus, between sampling instants, we will 
see segments of the plant step response. 

With the exception of the odd-looking forward transfer function, Eq. (5.32) 
looks like the familiar feedback formula: forward-over-one-plus-feedback. Un- 
fortunately, the sequence of equations by which Eq. (5.32) was computed was 
a bit haphazard, and such an effort might not always succeed. Another example 
will further illustrate the problem. 


€ Example 5.3 


Figure 5.14 

A simple system that 
does not have a transfer 
function 


Another Block Diagram Analysis 
Compute Y*and Y for the block diagram of Fig. 5.14. 


Solution. The equations describing the system are (all symbols are Laplace transforms) 


E=R-Y, (a) 
U=HE, (b) (5.39) 
Y-U'G; (o) 


and after sampling, the equations are 


E* 2 R* = Y’, (a) 
UHE): (b) (5.40) 
Ye -—-UsG (c) 


How do we solve these? In Eq. (5.40(b)) we need E, not E*. So we must go back to Eq. 
(5.39(a)) 


U* =(H(R — Y)* 
-(HRy — (HY). 


Using Eq. (5.39(c)) for Y, we have 
U* = (HRY - (HU*G)*. 
Taking out the periodic U* in the second term on the right gives 


U* = (H RY — U*(HG)*. 
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Solving, we get 


e m R)* 
1+ (HG)* Gati 
From Eq. (5.40(c)), we can solve for Y* 
iow, OA 
= TLUGY Gr. (5.42) 
+ 


Equation (5.42) displays a curious fact. The transform of the input is bound 
up with H (s) and cannot be divided out to give a transfer function! This system 
displays an important fact that with the manipulations of stars for sampling might 
be overlooked: A sampled-data system is time varying. The response depends 
on the time relative to the sampling instants at which the signal is applied. Only 
when the input samples alone are required to generate the output samples can 
we obtain a discrete transfer function. The time variation occurs on the taking 
of samples. In general, as in Fig. 5.14, the entire input signal r(t) is involved in 
the system response, and the transfer-function concept fails. Even in the absence 
of a transfer function, however, the techniques developed here permit study of 
stability and response to specific inputs such as step, ramp, or sinusoidal signals. 

We need to know the general rules of block-diagram analysis. In solving 
Fig. 5.14 we found ourselves working with U, the signal that was sampled. This 
is in fact the key to the problem. Given a block diagram with several samplers, 
always select the variables at the inputs to the samplers as the unknowns. Being 
sampled, these variables have periodic transforms that will always factor out 
after the sampling process and result in a set of equations in the sampled (starred) 
variables that can be solved. 


@ Example 5.4 


Figure 5.15 
A final example for 
transfer-function analysis 


of sampled-data systems 


Another Block Diagram Analysis 


Compute the transforms of Y* and Y for the block diagram drawn in Fig. 5.15. 


Solution. We select E and M as independent variables and write 
E(s) = R ~ M'*G,, 
M(s) = E*HG,. (5.43) 
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Next we sample these signals, and use the “if periodic, then out” rule from Eq. (5.19): 
E* = R* — M*G} 
M* = E*(HG,)". (5.44) 
We solve these equations by substituting for M* in Eq. (5.44) from Eq. (5.43) 
E* = R* — E*(HG,)*G3 
R’ 


= ——___—__.. 5.45 
1+(HG,)*G} DIEM 
To obtain Y we use the equation 
SEH: 
R°H 
= — 5.46 
1+ (HG)*G5 quo 
and 
RH? 
Y = ———_—___ (5.47) 


~ 1+ (HG,)°G" 
In this case we have a transfer function. Why? Because only the samples of the external input 
are used to cause the output. To obtain the z-transform of the samples of the output, we would 
let e7 =z in Eq. (5.47). From Eq. (5.46) we can solve for the continuous output, which 
consists of impulses applied to H (s) in this case. 


* 


Asa final example of analysis of sampled-data systems we consider a problem 
of experimental transfer function measurement in a sampled-data system. 


€ Example 5.5 


Figure 5.16 


Measuring the Transfer Function of a Sampled-Data System 


It has been proposed to use an experiment to measure the loop gain of a trial sampled-data 
design on the actual physical system using the setup of Figure 5.16. The proposal is to have 
zero reference input but to inject a sinusoid into the system at W and to measure the responses 


A block diagram for experimental measurement of a sampled-data transfer function 


Controller 
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Figure 5.17 

A block diagram for 
experimental 
measurement of a 
sampled-data transfer 
function with sampled 
input 


. Compute the transforms of E, and E, for a general signal input at w. 


at that frequency at locations E, and E,. It is thought that the (complex) ratio of these signals 
will give the loop gain from which the gain and phase margins can be determined and with 
which a frequency response design can be worked out. 


. Suppose that the signal w is a sinusoid of frequency w, less than 2/T (no aliasing). Plot 


the spectra of GW and (GW)* and show that (GW)* — +GW at the frequency wy. 


when wy < 2/T. 


and held before being applied to the system. 


Solution. 


signals as follows 


Solving Eq. (5.52) for U* 


Controller 


E,—-W-U'H 
E,=U*H 

Y = WG + U*HG 

Y* = (WG) + U*(HGY 
U*--py 


D*(WG)* 


U* (s) MEN NER T TT omm 
1 4- D'(HG) 


T 
RN 
ES 


H 


. Use the results of 2) to get an expression for the complex ratio of the signals E, and E, 


. Repeat these calculations for the setup of Fig. 5.17 where the input signal is first sampled 


1. Following the procedure just given, we express the signals of interest in terms of sampled 


(5.48) 
(5.49) 
(5.50) 
(5.51) 
(5.52) 


(5.53) 
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If we now substitute this result into Eq. (5.48) and Eq. (5.49) we have the solution of this 
part as 
D*(WG)* 
1+ D'(HGY 
= D*(WG)* 
21+ D*(HG)" 

Clearly we do not have a transfer function since the transform of the signal is imbedded 
in the signal transforms. 


E =W- 


(5.54) 


. For the second part, we can consider the sinusoid one exponential at a time and consider 
w = 2z (wv — wp). Then 
| &S 2z 2nk 
GW) = iw — ik —)2zó(wv — EE 
(GW) r2. Ge jk Y2n8(o — wy — =) 
The spectra involved are easily sketched. Since w, < z/ T there is no overlap and at w 
the signal is 


l 
(GW)* = T G(jog)2x6(w — wy) 


(5.55) 


. If we substitute Eq. (5.55) into Eq. (5.54) and take the ratio, we find the describing 
function 
l 
E —D*GH 
2s-— T ______ (5.56) 
1 1+ D*(GH)* — pO GH 


Notice that if [G| = 0 for |w| > x/T so that G* = G for frequencies less than x/T , 
then Eq. (5.56) reduces to 


E —--—D'(GH)', 

E, 

which is the transfer function. Thus the proposed method works well if there is a good 
antialias filter in the loop. 

. With the input applied through asample and hold as drawn in Fig. 5.17 the key expressions 
are given by 


E, = U*H + W*H 


E, = U*H 
U* = DE? 
Y = U*HG + W* HG. (5.57) 
These equations can be readily solved, after taking the "star" of Y to give 
W'H 
1 T+ DUGY 
D*(HG)* z 


w STED HOR 
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From these, the ratio gives the true discrete transfer function 


E, 
— = —D*(HG)*. 
E, 


5.5 Calculating the System Output Between 
Samples: The Ripple 


In response to a particular input, the output of a sampled-data system at sam- 
pling instants can be computed by the z-transform, even in those cases where 
there is no transfer function. However, in many problems it is important to 
examine the response between sampling instants, a response that is called the 
*ripple" in the response. Often, for example, the maximum overshoot will not 
occur at a sampling instant but at some intermediate point. In other cases, hid- 
den oscillations are present, oscillations that may or may not decay with time. 
The ripple is generated by the continuous-time part of the system at the out- 
put. For example, in the case drawn in Fig. 5.13, the ripple is given by the 
response of G(s)/s between sampling instants. Three techniques have been sug- 
gested to compute ripple. The first, suggested by J. Sklansky, is based on the 
partial-fraction expansion of G(s)/s. The second, suggested by E. Jury, is based 
on introducing a time shift in the sampler at the output of the system. If this 
shift is less than a sample period, the new samples are taken between the sys- 
tem samples. The modified transform from input samples to shifted samples is 
called the modified z — transform of G(s)/s. The third technique, introduced 
by G. Kranc, is based on sampling the output at a faster rate than the feedback 
loop is updated. Block diagrams representing the three methods are given in 
Fig. 5.18. 

In the partial-fraction method, shown in Fig. 5.18(a), the outputs of the sev- 
eral fractions are sampled and the values of y, (kT), y,(kT), ... are computed in 
the regular way with z — transforms or MATLABstatements. These values at the 
instant kT represent initial conditions for the several partial fraction continuous 
dynamics at time K T and from them the transient over the period from KT to 
(k + 1)T can be computed. The total system output is the sum of these compo- 
nents. The method is somewhat tedious but gives an exact expression for the ripple 
during any given sample period from which, for example, the peak overshoot can 
be exactly computed. 

The modified z — transform of the plant with zero-order hold is defined as 

G(z, m= (=z {SO onl 0 € m « 1, 
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Figure 5.18 
Three methods used to evaluate ripple. (a) Partial fraction expansion; (b) Modified 
z-transform; (c) Multirate sampling 


(a) 


—o y(kT + mT) 


(b) 


—w(kT/n) 


(c) 


and represents samples taken at the times kT + mT. The modified transform of 
the output of the system shown in Fig. 5.18(b) is given by 


_ D()G(z, m) 
Yen = 5 DOG 


and its inverse will give samples at KT + mT. The modified operation is noncausal 
but is only being used as a computational device to obtain inter-sample ripple. 
For example, if m = 0.5 then use of Eq. (5.58) will find sample values halfway 
between sample updates of the control. MATLAB only permits delays (causal 
models) and can be used to find the output of the modified plant delayed by one 
sample shown in the figure as z~! Y (z, m). If the plant is given in state form with 
description matrices (F, G, H, J), then the representation of the delayed modified 
transform can be computed in MATLAB using 


SYS = ss(F, G, Hj J). 


R(z), (5.58) 


The delay for sample period 7 and shift m is set by the command 


set(SYS,'td', (1 — m)T), 
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5.6 


5.7 


and finally, the discrete representation of the system which has a delayed modified 
z — transform is given by the standard conversion 


SYSD— G20 (XS, Tj. 


The method of multi-rate sampling is shown in Fig. 5.18(c). The output of 
the controller is held for the full T seconds but this signal is again sampled at 
the rate 7/n for some small n, such as 5. The plant output is also sampled at the 
rate T/n. The feedback loop is unchanged by these additional samplers but the 
output ripple is now available at n points in between the regular sample times. 
This technique is readily programmed in MATLAB and is regularly used in this 
book to compute the ripple. An interesting case is given in Fig. 7.14 where it can 
be seen that the maximum overshoot occurs in the ripple. 


Summary 


In this chapter we have considered the analysis of mixed systems that are partly 
discrete and partly continuous, taking the continuous point of view. These systems 
arise from digital control systems that include A/D and D/A converters. The 
important points of the chapter are 


The large-signal behavior of an A/D converter can be modeled as a linear 
impulse modulator followed by a zero-order-hold. 

D/A converter can be modeled as a zero-order-hold. 

The transform of a sampled signal is periodic with period 27/7 for sample 
period T. 

Sampling introduces aliasing, which may be interpreted in both the frequency 
and the time domains. 

The sampling theorem shows how a band-limited signal can be reconstructed 
from its samples. 

Interconnections of systems that include sampling can be analyzed by block- 
diagram analysis. 

If the input signal to a sampled-data system is not sampled, it is impossible 
to define a transfer function. 

The output of a sampled-data system between sampling instants can be com- 
puted using partial fraction expansion, using the modified z — transform, 
or by multi-rate sampling. With a computer, multi-rate sampling is the most 
practical method. 


Problems 


S.1 Sketch a signal that represents bounded hidden oscillations. 


Figure 5.19 
Impulse response of a 
first-order hold filter 


Figure 5.20 

Block diagram of a 
sample and first-order 
hold 


Figure 5.21 


Asampled-data system 


5.2 


5.3 


5.4 


5.5 
5.6 


5.7 


5.7 Problems 183 


Show how to construct a signal of hidden oscillations that grows in an unstable fashion. 
Where in the s-plane are the poles of the transforms of your signal(s)? 


A first-order hold is a device that extrapolates a line over the interval from kT to (k + 1)T 
with slope given by the samples r(kT — T) and r(kT) starting from r(kT) as shown in 
Fig. 5.19. Compute the frequency response of the first-order hold. 


Consider the circuit of Fig. 5.20. By plotting the response to a signal that is zero for all 
sample instants except = 0 and thatis 1.0 at ¢ = 0, show that this circuit implements a 
first-order hold. 


Sketch the step response y(t) of the system shown in Fig. 5.21 for K = H, l, and 2. 


Sketch the response of a second-order hold circuit to a step unit. What might be the major 
disadvantage of this data extrapolator? 


A triangle hold is a device that has an output, as sketched in Fig. 5.22 that connects the 
samples of an input with straight lines. 


(a) Sketch the impulse response of the triangle hold. Notice that it is noncausal. 
(b) Compute the transfer function of the hold. 
(c) Use MATLAB to plot the frequency response of the triangle hold. 


IT 2T 3T 4T 


+ 
r (2) 4 


EE 
—M 
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Figure 5.22 
Response of a sample 
and triangle hold 


5.8 


5.9 


5.10 


5.11 


5.12 


5.13 


5.14 
5.15 


5.16 


(d) How would the frequency response be changed if the triangle hold is made to be 
causal by adding a delay of one sample period? 


Sketch the output of a sample and zero-order hold to 

(a) A step input. 

(b) A ramp input. 

(c) A sinusoid of frequency w,/10. 

Sketch the output of a sample and first-order hold to 

(a) A step input. 

(b) A ramp input. 

(c) A sinusoid of frequency c, /10. 

Sketch the output of a sample and triangle hold to 

(a) A step input. 

(b) A ramp input. 

(c) A sinusoid of frequency c, /10. 

Sketch the output of a sample and causal triangle hold to 

(a) A step input. 

(b) Aramp input. 

(c) A sinusoid of frequency c, /10. 

A sinusoid of frequency 11 rad/sec. is sampled at the frequency w, = 5 rad/sec. 

(a) Indicate the component frequencies up to w = 20 rad/ sec. 

(b) Indicate the relativeamplitudes of the components up to 20 rad/ sec . if the sampler 
is followed by a zero-hold. 

A signal r(t) = sin(2t) + sin(15t) is sampled at the frequency w, = 16. 

(a) Indicate the frequency of the components in the sampled signal up to w = 32. 


(b) Indicate the relative amplitudes of the signals in the output if the signal is passed 


through the anti-aliasing filter with transfer function qam before sampling. You 
: Lj 
can use MATLAB to compute the filter gain. 


Derive Eq. (5.38). 


Find the transform of the output Y (s) and its samples Y* (s) for the block diagrams shown 
in Fig. 5.23. Indicate whether a transfer function exists in each case. 


Assume the following transfer functions are preceded by a sampler and zero-order hold 
and followed by a sampler. Compute the resulting discrete transfer functions. 


Figure 5.23 

Block diagrams of 
sampled data systems. 
(a) Single loop; 

(b) multiple loop 


Figure 5.24 
Block diagrams showing 
the modified z-transform 


5.7 Problems 185 


5.17 


(a) :G; (3) =a/s* 

(b) G,() = e^! /(s +1) 

(c) G,(s) = I/s(s +1) 

(d) G,(s) = e^ 5'/s(s + 1) 

(9 G,(s) = 1/(s? — 1) 

One technique for examining the response of a sampled data system between sampling 


instants is to shift the response a fraction of a period to the left and to sample the result. 
The effect is as shown in the block diagram of Fig. 5.24 and described by the equation 


Y*(s,m) = R*(s)(G(s)e""5}*. 
As a function of z, the equivalent equation is 
Y(z) = R(z)G(z, m). 


The function G(z, m) is called the modified z — transform of G(s). In the figure, let 


1 
= ——,  R(s)=-,and T=}. 
(s +1) (s) 5 ad 


(a) Compute y(t) by calculating y(kT) from the ordinary z — transform and 
observing that between samples, the output y(t) is an exponential decay with 


G(s) 


R R* Y a 
x: te 


y(t) y(t + mT) 
Os=m<l 
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unit time constant. Sketch the response for five sample intervals. Notice that this 
technique is the essence of the partial-fraction method of obtaining the ripple. 


(b) Compute the modified z — transform for m = i and compute the samples 
according to the equation for Y (2, m). Plot these on the same plot as that of y(t) 
and verify that you have found the values at the mid-points of the sampling pattern. 


5.8 Appendix 


To compute the transform of a sinusoid, we consider first the Fourier transform 
of v(t) = e/*;'*?. For this we have 


V(jo) = ih gÜcrtidy e jo qu (5.59) 
This integral does not converge in any obvious way, but we can approach it from 
the back door, as it were. Consider again the impulse, ó(1). The direct transform 
of this object is easy, considering the sifting property, as follows 


f b(the “dt =1. 


oo 


Now the general form of the inverse Fourier transform is given by 


Jd es ata i FCGoye/"do. 
QI «as 


oo 


If we apply the inverse transform integral to the impulse and its transform, we 
take f(t) = &(r) and F(jw) = 1 with the result 


TP fira 
ó(t) = =| eda. 
2704 J =o 
However, except for notation and a simple change of variables, this is exactly 
the integral we needed to evaluate the spectrum of the single exponential. If we 
exchange ¢ with w the integral reads 


| UP hese 
ô = — gdje 
ea) 2x i E 


oo 


Eq. (5.59) is of this form 


oo 
V(jo) =f RT 
—oo 


oo 
= at | eie- qt 


oo 


= 2ne/*8(m — w,). 


At the last step in this development, the sign of the argument in the delta 
function was changed, which is legal because 5(t) is an even function and 6(t) = 
6(—t). The argument is more natural as (w — w,) rather than the opposite. 


6 


Discrete Equivalents 


Figure 6.1 

Magnitude of a 
low-frequency bandpass 
filter with a narrow 
rejection band 


A Perspective on Computing Discrete Equivalents 


One of the exciting fields of application of digital systems! is in signal process- 
ing and digital filtering. A filter is a device designed to pass desirable signal 
components and to reject undesirable ones; in signal processing it is common to 
represent signals as a sum of sinusoids and to define the “desirable components” 
as those signals whose frequencies are in a specified band. Thus a radio receiver 
is designed to pass the band of frequencies transmitted by the station we want 
to hear and reject all others. We would call a filter which does this a bandpass 
filter. In electrocardiography it often happens that power-line frequency signals 
are strong and unwanted, so a filter is designed to pass signals between ] and 500 
Hz but to eliminate those at 60 Hz. The magnitude of the transfer function for 
this purpose may look like Fig. 6.1 ona log-frequency scale, where the amplitude 
response between 59.5 and 60.5 Hz might reach 107°. Here we have a band-reject 
filter with a 60-dB rejection ratio in a 1-Hz band centered at 60 Hz. 

In long-distance telephony some filters play a conceptually different role. 
There the issue is that ideal transmission requires that all desired frequencies be 


20 log | HI t 


1 Including microprocessors and special-purpose devices for digital signal processing, called DSP chips. 
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emulation 


treated equally but transmission media—wires or microwaves—introduce distor- 
tion in the amplitude and phase of the sinusoids that comprise the desired signal 
and this distortion must be removed. Filters to correct the distortion are called 
equalizers. Finally, the dynamic response of control systems requires modifica- 
tion in order for the complete system to have satisfactory dynamic response. We 
call the devices that make these changes compensators. 

Whatever the name—filter, equalizer, or compensator—many fields have use 
for linear dynamic systems having a transfer function with specified character- 
istics of amplitude and phase. Increasingly the power and flexibility of digital 
processors makes it attractive to perform these functions by digital means. The 
design of continuous electronic filters is a well-established subject that includes 
not only very sophisticated techniques but also well-tested computer programs 
to carry out the designs [Van Valkenburg (1982)]. Consequently, an important 
approach to digital filter design is to start with a good analog design and con- 
struct a filter having a discrete frequency response that approximates that of the 
satisfactory design. For digital control systems we have much the same motiva- 
tion: Continuous-control designs are well established and one can take advantage 
of a good continuous design by finding a discrete equivalent to the continuous 
compensator. This method of design is called emulation. Although much of our 
presentation in this book is oriented toward direct digital design and away from 
emulation of continuous designs with digital equivalents, it is important to un- 
derstand the techniques of discrete equivalents both for purposes of comparison 
and because it is widely used by practicing engineers. 


Chapter Overview 


The specific problem of this chapter is to find a discrete transfer function that 
will have approximately the same characteristics over the frequency range of 
importance as a given transfer function, H(s). Three approaches to this task are 
presented. The first method is based on numerical integration of the differential 
equations that describe the given design. While there are many techniques for 
numerical integration, only simple formulas based on rectangular and trapezoid 
rules are presented. The second approach is based on comparisons of the s and 
z domains. Note that the natural response of a continuous filter with a pole at 
some point s — s, will, when sampled with period T, represent the response of a 
discrete filter with a pole at z = e°”. This formula can be used to map the poles 
and zeros of the given design into poles and zeros of an approximating discrete 
filter. This is called pole and zero mapping. The third and final approach is based 
on taking the samples of the input signal, extrapolating between samples to form 
an approximation to the signal, and passing this approximation through the given 
filter transfer function. This technique is called hold equivalence. The methods 
are compared with respect to the quality of the approximation in the frequency 
domain as well as the ease of computation of the designs. 


6.1 
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Design of Discrete Equivalents via Numerical 
Integration 


The topic of numerical integration of differential equations is quite complex, 
and only the most elementary techniques are presented here. For example, we 
only consider formulas of low complexity and fixed step-size. The fundamental 
concept is to represent the given filter transfer function H (s) as a differential 
equation and to derive a difference equation whose solution is an approximation 
of the differential equation. For example, the system 


ES) dito 
Ry sa (6.1) 


is equivalent to the differential equation 


u+au = ae. (6.2) 


Now, if we write Eq. (6.2) in integral form, we have a development much like 
that described in Chapter 4, except that the integral is more complex here 


u(t) =} [—au(t) + ae(t)]dt, 
0 


kT-T kT 
u(kT) = Í [—au + ae]dt «f [—au + ae]ldt 
0 


kT-T 


2 area of —au + ae 
=waT-1) +] over kT -T € t <kT. (6.3) 


Many rules have been developed based on how the incremental area term is 
approximated. Three possibilities are sketched in Fig.6.2. The first approximation 
leads to the forward rectangular rule” wherein we approximate the area by the 
rectangle looking forward from kT — T and take the amplitude of the rectangle 
to be the value of the integrand at kT — T. The width of the rectangle is T. The 
result is an equation in the first approximation, u, 


u, (kT) =u, (kT — T) + T[-au (kT — T) + ae(kT — T)] 
= (1 — aT)u (kT — T) - aTe(kT — T). (6.4) 


The transfer function corresponding to the forward rectangular rule in this case 
is 


His EE 
AUCI ae 
a 
= — forward rectangular rule). 6.5 
Coie, - Ne» 


2 Also known as Euler’s rule. 
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Figure 6.2 

Sketches of three ways 
the area under the curve 
from kT to kT +T can 
be approximated: 

(a) forward rectangular 
rule, (b) backward 
rectangular rule, 


(0 trapez oid rule (a) Forward Rectangular Rule 


(b) Backward Rectangular Rule 


(c) Trapezoid Rule 


A second rule follows from taking the amplitude of the approximating 
rectangle to be the value looking backward from kT toward kT — T, namely, 
—au(kT) + ae(kT). The equation for u,, the second approximation,” is 


u(kT) = u,(kT — T) + T[-au(kT) + ae(kT)] 


u,(kT — T) aT 
mi el eT): 6.6 
l+aT eare ) (60) 


3 It is worth noting that in order to solve for Eq. (6.6) we had to eliminate (X7) from the right-hand 
side where it entered from the integrand. Had Eq. (6.2) been nonlinear, the result would have been an 
implicit equation requiring an iterative solution. This topic is the subject of predictor-corrector rules, 
which are beyond our scope of interest. A discussion is found in most books on numerical analysis. See 
Golub and Van Loan (1983). 
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Again we take the z-transform and compute the transfer function of the backward 
rectangular rule i 


aT 1 aTz 
Ha = Fo E 
l+aTl—z'/(+aT) 2zl+aT)-1 
——— (backward rectangular rule). (6.7) 
(z—-1)/Tz+a 


Our final version of integration rules is the trapezoid rule found by taking the 
area approximated in Eq. (6.3) to be that of the trapezoid formed by the average 
of the previously selected rectangles. The approximating difference equation is 


a(kT) =u,(kT —T) + slau (kT E 
+ae(kT — T) — au,(kT) t ae(kT)] (6.8) 
.,1-(8T/2) aT/2 " 
— GT (aT /2) T T)+ 1+ (aT/D [é,(eT — T) + e (XT)]. 


The corresponding transfer function from the trapezoid rule is 


aT (z 4- 1) 


H. m es 
Mc raDscal —2 
a 


z Q/T)[(z — D/(z - D] +a (trapezoid rule). — (6.9) 


Suppose we tabulate our results obtained thus far. 


H(s) Method Transfer function 
2 a 
pares Forward rule a eat 
2 Backward rule D a US: (6.10) 
ans 8 (z—D/Tza 
a . J B 
Eu Trapezoid rule H- Se ee 


From direct comparison of H (s) with the three approximations in this tabu- 
lation, we can see that the effect of each of our methods is to present a discrete 
transfer function that can be obtained from the given Laplace transfer function 
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H (s) by substitution of an approximation for the frequency variable as shown 
below 


Method Approximation 
z—]1 
Forward rule se T 
a (6.11) 
Backward Rule $ < 
Tz 
; 2z—1 
Trapezoid Rule Ses 
Tzt+l 


The trapezoid-rule substitution is also known, especially in digital and 
sampled-data control circles, as Tustin’s method [Tustin (1947)] after the British 
engineer whose work on nonlinear circuits stimulated a great deal of interest in 
this approach. The transformation is also called the bilinear transformation 
from consideration of its mathematical form. The design method can be summa- 
rized by stating the rule: Given a continuous transfer function (filter), H(s), a 
discrete equivalent can be found by the substitution 

H,(z) = H()l ai (6.12) 

Each of the approximations given in Eq. (6.11) can be viewed as a map from 
the s-plane to the z-plane. A further understanding of the maps can be obtained 
by considering them graphically. For example, because the (s = jw)-axis is the 
boundary between poles of stable systems and poles of unstable systems, it would 
be interesting to know how the jw-axis is mapped by the three rules and where 
the left (stable) half of the s-plane appears in the z-plane. For this purpose we 
must solve the relations in Eq. (6.11) for z in terms of s. We find 


De z Lus, (forward rectangular rule). 
ii) 2 = ToT’ (backward rectangular rule). (6.13) 
1+ Ts/2 
il) z = EET (bilinear rule). 


If we let s = jw in these equations, we obtain the boundaries of the regions 
in the z-plane which originate from the stable portion of the s-plane. The shaded 
areas sketched in the z-plane in Fig. 6.3 are these stable regions for each case. 
To show that rule (ii) results in a circle, E is added to and subtracted from the 
right-hand side to yield 


(6.14) 
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Figure 6.3 

Maps of the left-half of the s-plane by the integration rules of Eq. (6.10) into the 
z-plane. Stable s-plane poles map into the shaded regions in the z-plane. The unit circle 
is shown for reference. (a) Forward rectangular rule. (b) Backward rectangular rule. 

(c) Trapezoid or bilinear rule 


Unit circle 


(a) (b) (c) 
Now it is easy to see that with s = jw, the magnitude of z — i is constant 


l 
Iz zl =F 
and the curve is thus a circle as drawn in Fig. 6.3(b). Because the unit circle 
is the stability boundary in the z-plane, it is apparent from Fig. 6.3 that the 
forward rectangular rule could cause a stable continuous filter to be mapped into 
an unstable digital filter. 

It is especially interesting to notice that the bilinear rule maps the stable 
region of the s-plane exactly into the stable region of the z-plane although the 
entire jq@-axis of the s-plane is compressed into the 2x -length of the unit circle! 
Obviously a great deal of distortion takes place in the mapping in spite of the 
congruence of the stability regions. As our final rule deriving from numerical 
integration ideas, we discuss a formula that extends Tustin’s rule one step in an 
attempt to correct for the inevitable distortion of real frequencies mapped by the 
rule. We begin with our elementary transfer function Eq. (6.1) and consider the 
bilinear rule approximation 


H(z) = 


a 
Q/TYG — )0/G-- 1) +a 


The original AZ (s) had a pole at s = —a, and for real frequencies, s = jo, the 
magnitude of E (jc) is given by 


IH Go) = 


a 
w He 
m 
E e fat FT 
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Thus our reference filter has a half-power point, IH? = i, at w = a. It will be 
interesting to know where H, (z) has a half-power point. 

As we saw in Chapter 4, signals with poles on the imaginary axis in the s- 
plane (sinusoids) map into signals on the unit circle of the z-plane. A sinusoid of 
frequency w, corresponds to z, = e/^*. and the response of H T, (3) to a sinusoid 
of frequency w, is H,(z,). We consider now Eq. (6.8) for H, (2, 5 and manipulate 
it into a more convenient form for our present purposes 


2 ei^? —] 
H,(z)-— Ze serit) 


T e^? 41 
ei%T/2  g-jeT 

a Soo eem el^: T/2 ED e j^ T/2 +a) 

=a/(+ jun" +a). (6.15) 

The magnitude squared of H, will be į when 
2 i wT 
= tan = 
T 2 
or 
T T 

tan 2i- = T (6.16) 


Equation (6.16) is a measure of the frequency distortion or warping caused by 
Tustin's rule. Whereas we wanted to have a half-power point atw = a, we realized 
a half-power point at w, = (2/ T) tan ! (aT/2). w, will be approximately correct 
only ifaT/2 « 1 so that tan! (aT/2) = aT/2, that i is, if w, (= 2z / T) > a and 
the sample rate is much faster than the half-power frequency. We can turn our 
intentions around and suppose that we really want the half-power point to be at 
w,. Equation (6.16) can be made into an equation of prewarping: If we select 
a according to Eq. (6.16), then, using Tustin's bilinear rule for the design, the 
half-power point will be at w,. A statement of a complete set of rules for filter 
design via bilinear transformation with prewarping is 


1. Write the desired filter characteristic with transform variable s and critical 
frequency c, in the form H (s/o) ^ 
2. Replace w, by a such that 


4 The critical frequency need not be the band edge. We can use the band center of a bandpass filter or the 
crossover frequency of a Bode plot compensator. However, we must have w, < 7/7 if a stable filter is 
to remain stable after warping. 
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and in place of H(s/w,), consider the prewarped function H (s/a). For more 
complicated shapes, such as bandpass filters, the specification frequencies, 
such as band edges and center frequency, should be prewarped before the 
continuous design is done; and then the bilinear transformation will bring all 
these points to their correct frequencies in the digital filter. 

3. Substitute 

2z—1 

Pari 

in H(s/a) to obtain the prewarped equivalent H (2). 


As a frequency substitution the result can be expressed as 


s 

H,(z) = H(—) ; (6.17) 
€, t= —l_ zat 

w tan(w, 7/2) z+1 

It is clear from Eq. (6.17) that when w = w,, H,(2,) = H(jl) and the discrete 

filter has exactly the same transmission at w, as the continuous filter has at 

this frequency. This is the consequence of prewarping. We also note that as the 


sampling period gets small, H(z) approaches H(jw/w,). 


€ Example 6.1 


Computing a Discrete Equivalent 


The transfer function of a third order low-pass Butterworth filter? designed to have unity pass 
bandwidth (o, = l)is 

l 
H2 42541 
A simple frequency scaling would of course translate the design to have any desired passband 
frequency. Compute the discrete equivalents and plot the frequency responses using the forward 


rectangular rule, the backward rectangular rule, the Tustin bilinear rule and the bilinear rule 
with prewarping at w = 1. Use sampling periods T = 0.1, T = 1, and T = 2. 


H(s) = 


Solution. Computation of the discrete equivalents is numerically tedious and the state-space 
algorithms described below were used in MATLAB to generate the transfer functions and the 
response curves plotted in Fig. 6.4. Fig 6.4(a) shows that at a high sample rate (T = 0.1), 
where the ratio of sampling frequency to passband frequency is w Jo, = 63, all the rules do 
reasonably well but the rectangular rules are already showing some deviation. From Fig. 6.4(b) 
we see that at w/w, = 27 the rectangular rules are useless (the forward rule is unstable). 
Finally, in Fig. 6.4(c) at very slow sampling frequency with wlw, = x corresponding to a 
sampling period of T = 2 sec, only with prewarping do we have a design that comes even close 
to the continuous response. In each case at the Nyquist frequency, w = x /T . the magnitude 


5 A description of the properties of Butterworth filters is given in most books on filter design and briefly 
in Franklin, Powell and Emami-Naeini (2019). 
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Figure 6.4 

(a) Response of 
third-order lowpass filter 
and digital equivalents 
for w/w, = 207. 

(b) Response of 
third-order lowpass filter 
and digital equivalents 
for w/w, = 27. 

(c) Response of 
third-order lowpass filter 
and digital equivalents 
for w,/w, = 7 
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responses of the discrete filters start to repeat according to the periodic nature of discrete- 
transfer-function frequency responses. It can be seen that the magnitude and phase of the 
prewarped designs match those of the continuous filter exactly at the band edge, w = 1, for all 
these cases. This is no surprise, because such matching was the whole idea of prewarping. 


* 


The formulas for discrete equivalents are particularly simple and convenient 
when expressed in state-variable form and used with a computer-aided design 
package. For example, suppose we have a vector-matrix description of a contin- 
uous design in the form of the equations 


X = Ax + Be, 
u = Cx + De. (6.18) 
The Laplace transform of this equation is 
sX = AX + BE, 
U =CX + DE. (6.19) 


We can now substitute for s in Eq. (6.19) any of the forms in z corresponding to 
an integration rule. For example, the forward rectangular rule is to replace s with 
(z — 1)/T from Eq. (6.11) 


U = CX + DE. (6.20) 
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In the time domain, the operator z corresponds to forward shift; that is, zx(k) = 
x(k + 1). Thus the corresponding discrete equations in the time domain are 
x(k + 1) — x(k) = TAx(k) + TBe(k), 
x(k + 1) = (I+ TA)x(k) + TBe(k), 

u = Cx + De. (6.21) 

Equation (6.21) is a state-space formula for the forward rule equivalent. 
For the backward rule, substitute s «— (z — 1)/zT with the result 
2] 


X = AX + BE, 
z 
which corresponds to the time domain equations 
x(k +1) — x(k) = TAx(k + 1) + TBe(k + 1). (6.22) 


In this equation, there are terms in k + 1 on both the right- and left-hand sides. 
In order to get an equation with such terms only on the left, transpose all k + 1 
terms to the left and define them as a new state vector 


x(k + 1) — TAx(k + 1) — TBe(k + 1) = x(k) 


2 wk + D). (6.23) 
From this equation, solving for x in terms of w and e 
(I—AT)x = w+ TBe 
x = (I— AT) 'w - (I— AT) ?BTe. (6.24) 
With this expression for x, Eq. (6.23) can be put in standard form as 
wk + 1) = (I — AT)“'w(k) + (I - AT) BTe(&), (6.25) 
and the output equation is now 
u(k) = C(I - AT)*'w + (D + C(I - AT)'!BTje. (6.26) 


Equation (6.25) plus Eq. (6.26) are a state-space description of the backward rule 
equivalent to Eq. (6.18). 
Finally, for the trapezoid or bilinear rule, the z-transform equivalent is ob- 


tained from 
2(2-—1 
————É—X =AX+BE 
T(z 4-1) 
AT BT 
(z-1)X= Eu +1)X+ EC -DE 
U = CX + DE, (6.27) 


and the time domain equation for the state is 


x(k 4-1) — x(k) = AF lk +1)+x(k)) + PT elk +1)+e(k)). (6.28) 
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Once more, collect all the k + 1 terms onto the left and define these as w(k + 1) 
as follows® 


AT BT AT BT 
x(k + 1) — —-x( + 1) = elk + I) = x) + - X0 + Sel 
= /Twk + 1). (6.29) 


Writing the definition of w at time k, solve for x as before 


AT es AT\~' BT 
x= (1- +) Vwa (1- 57) se (630 


Substituting Eq. (6.30) into Eq. (6.29), we obtain 


J/Tw(k4-1) = (1+ +) (1 = +) [rw + Fel gn BT etk) 


2 2 
= 
w(k+1)= (45) (1- 27) w(k) 
AT ATA BVT 


AT ary ATN als 
= (1482) (1 8" «o + (1- 8T) ave 
(6.31) 


In following this algebra, it is useful to know that in deriving the last part of 
Eq. (6.31), we expressed the identity I as (I — re! — aD)" and factored out 
(I — r2 on the right. 
To obtain the output equation for the bilinear equivalent, we substitute Eq. 
(6.30) into the second part of Eq. (6.27): 
-l -I 
u(k) = VTC (1 - 25) w(k) + [v+ C ( - ) = e(k). 


These results can be tabulated for convenient reference. Suppose we have a 
continuous system described by 


x(t) = Ax(t) + Be(t), 
u(t) = Cx(r) + De(t). 


6 The scale factor of VT is introduced so that the gain of the discrete equivalent will be balanced between 
input and output, a rather technical condition. See Al Saggaf and Franklin (1986) for many more details. 
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6.2 


Then a discrete equivalent at sampling period T will be described by the equations 


w(k + 1) = ®w(k) + Fe(k), 
u(k) = Hw(k) + Je(k), 


where ®, I', H , and J are given respectively as follows: 


Forward Backward Bilinear 
b %I+AT (I— AT)'BT (I+ Af) - 477! 
r BT (I - AT)" (I— At)"'BYT 
C(I - AT)! VTC- à.) 
J D D-c-C-AT)'BT D-cCü-*€)'BTry2 


The MATLAB Control Toolbox provides for the computation of Tustin bi- 
linear equivalents with the function c2d. The syntax of computing the bilinear 
equivalent SYSD of a continuous system SYS at sampling period Ts is 


SYSD = c2d(SYS,Ts, ‘tustin’) (6.32) 


If ‘tustin’ is replaced with ‘prewarp’, the bilinear equivalent with prewarping is 
computed. 


Zero-Pole Matching Equivalents 


A very simple but effective method of obtaining a discrete equivalent to a contin- 
uous transfer function is to be found by extrapolation of the relation derived in 
Chapter 4 between the s- and z-planes. If we take the z-transform of samples of 
a continuous signal e(t), then the poles of the discrete transform F(z) are related 
to the poles of E(s) according to z = e*’. We must go through the z-transform 
process to locate the zeros of E(z), however. The idea of the zero-pole matching 
technique is that the map z = e?" could reasonably be applied to the zeros also. 
The technique consists of a set of heuristic rules for locating the zeros and poles 
and setting the gain of a z-transform that will describe a discrete, equivalent 
transfer function that approximates the given H(s). The rules are as follows: 


1. All poles of H(s) are mapped according to z = eT. If H(s) has a pole at 
s = —a, then H,,(z) has a pole at z = e~*" If H(s) has a pole at —a + jb, 
then H,, (z) has a pole at re^?, where r = e" and 6 = bT. 


2. All finite zeros are also mapped by z = e'*. If H(s) has a zero at s = —a, 
then H, (z) has a zero at z = e ^^^, and so on. 
3. The zeros of H (s) at s = oo are mapped in H,,(z) to the point z = —1. The 


rationale behind this rule is that the map of real frequencies from jw = 0 to 


€ Example 6.2 
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increasing c is onto the unit circle at z = e”? = | until z = e/” = —1. Thus 


the point z = — 1 represents, in a real way, the highest frequency possible 
in the discrete transfer function, so it is appropriate that if H (s) is zero at 
the highest (continuous) frequency, |H, 2) should be zero at z = —1, the 
highest frequency that can be processed by the digital filter. 


(a) If no delay in the discrete response is desired, all zeros at s = oo are 
mapped to z = —1. 
(b) If one sample period delay is desired to give the computer time to com- 
plete the output calculation, then one of the zeros at s = co is mapped 
to z — oo and the others mapped to z — —1. With this choice, H, (2) 
is left with a number of finite zeros one fewer than the number of finite 
poles. 
4. The gain of the digital filter is selected to match the gain of H (s) at the band 
center or a similar critical point. In most control applications, the critical 
frequency is s = 0, and hence we typically select the gain so that 


HG). = HO 


A more complete discussion is contained in Section 8.3.3 in Franklin, Powell, 
Emami- Naeini, 7th or 8th edition. 


A Zero-pole Matching Equivalent 


Compute the discrete equivalent to 


a 
H(s)= 
(s) XE. 
by zero-pole matching. 
Solution. The pole of H (s) at s 2 —a will mapto a pole of H (z) at e^?" Thezeroat s = co 
will map to a zero at z = —1. The gain of H(s) at s = 0 is 1. To match this gain in H(z) at 
z — | requires gain of —— The final function is given by 
(z 4 (1 — 67") 
ime (6.33) 
or, using rule 3(b), the result is 
ic e? 
e 


As with the rules based on numerical analysis, an algorithm to generate the 
matched zero-pole equivalent is also readily constructed. In MATLAB, a matched 
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Figure 6.5 


6.3 


zero-pole equivalent, SYSD, at sample period Ts to the continuous system, SYS, 
is given by 


SYSD = c2d(SYS, Ts, matched’). 


The frequency response of the matched zero-pole equivalent for the third-order 
Butterworth filter of Example 6.1 is plotted in Fig. 6.9 along with that of other 
equivalents for purposes of comparison. 


Hold Equivalents 


For this technique, we construct the situation sketched in Fig. 6.5. The samplers in 
Fig. 6.5(b) provide the samples at the input of H,, (z) and take samples at its output 
insuring that H,,(z) can be realized as a discrete transfer function. The philosophy 
of the design is the following. We are asked to design a discrete system that, with 
an input consisting of samples of e(t), has an output that approximates the output 
ofthe continuous filter H (s) whose input is the continuous e(t). The discrete hold 
equivalent is constructed by first approximating e(t) from the samples e(k) with 
a hold filter and then putting this e,(r) through the given H (s). There are many 
techniques for taking a sequence of samples and extrapolating or holding them to 
produce a continuous signal.’ Suppose we have the e(t) as sketched in Fig. 6.6. 
This figure also shows a sketch of a piecewise constant approximation to e(t) 


System construction for hold equivalents. (a) A continuous transfer function. (b) Block 
diagram of an equivalent system. 


e(t) | HG) | u(t) 


(a) 


H mua ) 
(b) 


7 Some books on digital-signal processing suggest using no hold at ali, using the equivalent 
H(z) = Z{H(s)}. This choice is called the z-transform equivalent. 


Figure 6.6 


A signal, its samples, and 


its approximation by a 
zero-order hold 


6.3.1 
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e(t) a(t) 


obtained by the operation of holding e, (f) constant at e(k) over the interval from 
kT to (k + 1)T. This operation is the zero-order hold (or ZOH) we've discussed 
before. If we use a first-order polynomial for extrapolation, we have a first-order 
hold (or FOH), and so on for higher-order holds. 


Zero-Order Hold Equivalent 


If the approximating hold is the zero-order hold, then we have for our 
approxima-tion exactly the same situation that in Chapter 4 was analyzed as a 
sampled-data system.® Therefore, the zero-order-hold equivalent to H(s) is 
given by Eq. (4.41), which is 
H 
Ho) = 0-2) Z ae | (6.35) 


S 


€ Example 6.3 


A Hold Equivalent 


Find the zero-order-hold equivalent to the first-order transfer function 


Hm 


Solution. The partial fraction expansion of the s-plane terms of Eq. (6.35) is 


H(s) _ a Zl l 
s ~ s(s+a) s s+a 
and the z-transform is 
H 1 I 
z[*9 = HEFTE (6.36) 
S S s+a 


8 Recall that we noticed in Chapter 5 that the signal ê is, on the average, delayed from e by T/2 sec. The 


size of this delay is one measure of the quality of the approximation and can be used as a guide to the 
selection of 7.. 


204 Chapter6 Discrete Equivalents 


6.3.2 


Figure 6.7 

Impulse response of the 
extrapolation filter for 
the triangle hold 


and, by definition of the operation given in Eq. (6.36) 


H oo oo 
Z l S | = Y - 3 ate 
$ 0 0 
T 1 
s (= 27! l Sen hN 
_a sera D MU deis 
^ Q-z5ü-ez7 c 
Substituting Eq. (6.37) in Eq. (6.35), the zero-order-hold equivalent of H (s) is found as 
(1 — 6777) 


Ayo (2) = "a. Lee (6.38) 


£=% 
* 


(6.37) 


We note that for the trivial example given, the zero-order-hold equivalent of 
Eq. (6.38) is identical to the matched zero-pole equivalent given by Eq. (6.34). 
However, this is not generally true as is evident in the comparison with frequency 
responses of other equivalents for the third-order Butterworth filter example 
plotted in Fig. 6.9. Because a sample and zero-order hold is an exact model for 
the sample and hold with A/D converter used in the majority of discrete systems, 
we have already seen the computation of this equivalent in MATLAB as 


SYSD = c2d(SYS, Ts,'zoh’) 


where the continuous system is described by SYS and the sample period is Ts. 


A Non-Causal First-Order-Hold Equivalent: The 
Triangle-Hold Equivalent 
An interesting hold equivalent can be constructed by imagining that we have a 


noncausal first-order-hold impulse response, as sketched in Fig. 6.7. The result 
is called the triangle-hold equivalent to distinguish it from the causal first-order 
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hold. The effect of the triangle hold is to extrapolate the samples so as to connect 
sample to sample in a straight line. Although the continuous system that does 
this is noncausal, the resulting discrete equivalent is causal. 
The Laplace transform of the extrapolation filter that follows the impulse 
sampling is 
eT -p ae er 
m ` 
Therefore the discrete equivalent that corresponds to Eq. (6.35) is 


(z — 1)? H(s) 
H, (e) = Tz Z E. (6.39) 
€ Example 6.4 A Triangle-Hold Equivalent 
Compute the triangle-hold equivalent for H (s) = 1/s?. 
Solution. In this case, from the tables of z-transforms 
ET MEER 
OE S 
3/002 
4 
-" qe (o d E DE (6.40) 
6 (z — 1) 
and direct substitution into Eq. (6.39) results in 
(z — 1? T? (e +424 1)z 
HG) = CIE CT NE, 
Tz 6 (z — 1) 
2.2 
4 
sT sel. (6.41) 
6 (2-1) 
* 


An alternative, convenient way to compute the triangle-hold equivalent is 
again to consider the state-space formulation. The block diagram is shown in 
Fig. 6.8. The continuous equations are 

x = Fx + Gv, 
Ue Ae, 
w — u(t 4- T)à(t + T) — 2u(t)d(t) -- u(t — T)d(t — T), (6.42) 


Figure 6.8 x 
Block diagram of the g eT! 247i sad id i ” »| H() 
triangle-hold equivalent T 
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and, in matrix form, 


x FG 0 x 0 
ò [2| 0 0 1/T v |+| 0 fa (6.43) 
w 00 0 w 1 


where 4 represents the input impulse functions. We define the large matrix in Eq. 
(6.43) as F,, and the one-step solution to this equation is 


G(T + 1) = e™TG(kT) 


because u consists only of impulses at the sampling instants. If we define 


bor T 
exp(F,7) = ; i ; (6.44) 


then the equation in x becomes 
x(k + 1) = x(k) — T v(k) + C, w(k). 


With care, the last two equations of Eq. (6.42) can be integrated to show that 
v(k) = u(k) and that w(k) = u(k + 1) — u (k). If a new state is defined as E(k) = 
x(k) — P,u(&), then the state equation for the triangle equivalent is 
E(k +1) = B(E(k)  P,u()) + (7, — Puck) 
= Pk) + (T, + 9T, — T, )u(k). (6.45) 


The output equation is 
y(k) = Hx(&) + Ju(k) 


= H (E(k) +P u(k)) + Juck) 
= HE(k) + (J + HT ,)u (k). (6.46) 


Thus the triangle equivalent of a continuous system described by [F, G, H, J] 
with sample period T is given by 


A- d, 

B =T, +4r, -F,, 

C=H, 

D=J+ Hb, (6.47) 


where 6, I’, , and P, are defined by Eq. (6.44). In the MATLAB Control Toolbox, 
the function c2d will compute the triangle-hold equivalent (referenced there as a 
first-order-hold equivalent) of continuous system SYS by 


SYSD = c2d(SYS, Ts,'foh’). 
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Figure 6.9 

Comparison of digital 
equivalents for sampling 
period (a) T = 1 and 
w/w, = 2x and 

(b) T=2andw,/o, =x 
where ZOH = 0, 
zero-pole = +, and 
triangle = x 


Normalized Frequency w/w, 


(b) 


In Fig. 6.9 the frequency responses of the zero-pole, the zero-order hold, 
and the triangle-hold equivalents are compared again for the third-order Butter- 
worth lowpass filter. Notice in particular that the triangle hold has excellent phase 
responses, even with the relatively long sampling period of T = 2, which corre- 
sponds to a sampling frequency to passband frequency ratio of only c, /o, =g 
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6.4 Summary 


In this chapter we have presented several techniques for the construction of dis- 
crete equivalents to continuous transfer functions so that known design methods 
for continuous systems can be used as a basis for the design of discrete systems. 
The methods presented were 


1. Numerical integration 


(a) Forward rectangular rule 

(b) Backward rectangular rule 

(c) Trapezoid, bilinear, or Tustin’s rule 

(d) Bilinear transformation with prewarping 


2. Zero-pole matching 
3. Hold equivalents 


(a) Zero-order-hold equivalent 
(b) Noncausal first-order- or triangle-hold equivalent 


All methods except the forward rectangular rule guarantee a stable discrete 
system from a stable continuous prototype with the provision that the warping 
frequency of the bilinear transformation with prewarping must be less than the 
Nyquist frequency of | rad/sec. Zero-pole matching is the simplest method 
to apply computationally if the zeros and poles of the desired filter are known 
and takes advantage of the known relations between response and poles and 
zeros. This is one of the most effective methods in the context of an overall 
design problem and in later chapters the zero-pole matching method is frequently 
selected. With a reasonable computer-aided-design tool, the designer can select 
the method that best meets the requirements of the design. The MATLAB function 
C2d computes the discrete description for most of these discrete equivalents from 
a continuous system described by SYS with sample period 7s as follows. 


SYSD =  c2d(SYS, Ts, method) where 
method = ‘zok’ zero-order hold 
method = ‘foh’ first-order hold (triangle hold) 
method =  -'tustin' Tustin's bilinear method 
method =  'prewarp' bilinear with prewarping 


method = ‘matched’ zero-pole matching 
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6.5 Problems 


6.1 


6.2 
6.3 


6.4 


Sketch the zone in the z-plane where poles corresponding to the left half of the s-plane 
will be mapped by the zero-pole mapping technique and the zero-order-hold technique. 
Show that Eq. (6.15) is true. 
The following transfer function is a lead network designed to add about 60° phase lead 
at w = 3 rad/sec 

51 
Q.1s 1. 


(a) For each of the following design methods compute and plot in the z-plane the pole 
and zero locations and compute the amount of phase lead given by the equivalent 
network at z, — ei^? if T = 0.25 sec and the design is via 


H(s) = 


i. Forward rectangular rule 

ii. Backward rectangular rule 

iii. Bilinear rule 

iv. Bilinear with prewarping (use w, as the warping frequency) 
v. Zero-pole mapping 

vi. Zero-order-hold equivalent 

vii. Triangular-hold equivalent 


(b) Plot over the frequency range w, = 0.1 — w, = 100 the amplitude and phase Bode 
plots for each of the above equivalents. 

The following transfer function is a lag network designed to increase K, by a factor of 

10 and have negligible phase lag at w, = 3 rad/sec. 

10s +1 

1 00s+ 1° 

(a) For each of the following design methods, compute and plot on the z-plane the 
zero-pole patterns of the resulting discrete equivalents and give the phase lag at 
z; = e/"1" corresponding to w, = 3 rad/sec. Let T = 0.25 sec. 


H(s) = 10 


jd Forward rectangular rule 
ii. | Backward rectangular rule 
iii. Bilinear rule 


iv. Bilinear with prewarping (Use w, = 3 rad/sec as the warping frequency) 


v. Zero-pole matching 
vi. Zero-order-hold equivalent 
vii.  Triangle-hold equivalent 


(b) For each case computed, plot the Bode amplitude and phase curves over the range 
w,= 0.01 — w,= 10 rad/sec. 
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Design Using Transform 
Techniques 


A Perspective on Design Using Transform Techniques 


The idea of controlling processes that evolve in time is ubiquitous. Systems 
from airplanes to the national rate of unemployment, from unmanned space ve- 
hicles to human blood pressure, are considered fair targets for control. Over a 
period of three decades from about 1930 until 1960, a body of control theory was 
developed based on electronic feedback amplifier design modified for servomech- 
anism problems. This theory was coupled with electronic technology suitable for 
implementing the required dynamic compensators to give a set of approaches 
to solve control problems now often called classical techniques to distinguish 
these methods from designs based on a state-space formulation which came to be 
called modern techniques. The landmark contributors to this “classical” theory 
are Evans (1950) [root locus] and Nyquist (1932) and Bode (1945) [frequency 
response]. For random inputs, the work of Wiener (1948) should be added. The 
unifying theme of these methods is the use of Fourier and Laplace transforms to 
represent signals and system dynamics and to describe the control specifications. 
Controller design is then carried out in the selected transform domain. From the 
perspective of the 90's the terms “classical” and “modern” seem a bit pejorative 
and we prefer to classify the methods as transform techniques and state-space 
techniques. 

The methods based on transforms were developed before computers were 
available and the engineer had to depend on hand calculations and careful hand 
plotting to achieve the design. The availability of computers and software such as 
MATLAB have made calculations and plotting simple, fast and accurate; and now 
the hand-plotting guidelines are used as verification of the automatic calculations 
and as a guide to design decisions. In this role, the understanding of the design 
process gained by the experience of doing a simple design by hand is well 
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design by emulation 


design by root locus 


design by frequency 
response 


7.1 


worth the effort spent in developing the required skills. The introduction of 
digital control and sampled data adds new constraints and new possibilities to 
the transform design methods. The z-transform is added to the Laplace and the 
Fourier transforms and poles and zeros have meaning relative to the unit circle 
rather than to the imaginary axis. The meaningful part of the frequency response 
is restricted to half the sampling frequency. Each of these developments must be 
understood in order to apply transform methods to digital control. 


Chapter Overview 


Building on previous understanding of the design of continuous systems, the first 
method for digital design is based on emulation of a continuous design. The 
continuous controller is simply replaced with a digital equivalent computed by 
using one of the techniques described in Chapter 6. The result may be evaluated 
in terms of poles and zeros in the z-plane, magnitude and phase in the frequency 
response, or transient response to step, impulse or other input. 

The second method introduced is the root locus where it is demonstrated 
that the rules of the root locus are unchanged from the continuous case but the 
relations between pole location and time response must refer to the z-plane rather 
than the s-plane. 

Finally, the Nyquist stability criterion for discrete systems is developed and 
Bode’s design methods for gain and phase margins are extended to discrete 
systems. In addition to the usual results, the concept of system sensitivity is 
developed to show how frequency response can be used to cause the system to 
be robust with respect to both stability and performance when the plant transfer 
function is subjected to bounded but unknown perturbations. 


System Specifications 


We first consider the design specifications that the controller is expected to 
achieve. As reviewed in Chapter 2, the central concerns of controller design 
are for good transient and steady-state response and for sufficient robustness. 
Requirements on time response and robustness need to be expressed as constraints 
on s-plane pole and zero locations or on the shape of the frequency response in 
order to permit design in the transform domains. Dynamic performance in the 
time domain is defined in terms of parameters of system response to a step in 
command input. The most frequently used parameters are the rise time, ż,; the 
settling time, ¢,; the percent overshoot, M; and the steady-state error, e,.. These 
parameters, which apply equally well to discrete control as to continuous control, 
are discussed in Section 4.1.7. The s-plane expressions of these requirements are 
summarized by the following guidelines: 


velocity constant 


Figure 7.1 


A unity feedback system 
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e The requirement on natural frequency is 
c, 2 1.8/t,. (7.1) 
e The requirement on the magnitude of the real part of the pole is 


|Re{s,}| =o = £c, > 4.6/1,. (7.2) 
* The fractional overshoot, M,, is given in terms of the damping ratio, £, 
by the plot of Fig. 2.7 which can be very crudely approximated by 


ç ~ 0.6(1 — M,). (7.3) 


The specifications on steady-state error to polynomial inputs is determined 
by the error constant appropriate for the case at hand as described in Section 4.2.2. 
The most common case is for systems of Type 1, which is to say, systems that 
have zero steady-state error to a step input and finite error to a ramp input of slope 
rg of size e, = r,/K, where K, is the velocity constant. For a single-loop system 
with unity feedback gain and forward transfer function D(s)G(s) as shown in 
Fig. 7.1, the system is Type 1 if DG has a simple pole at s = 0. 


* The velocity constant is then given by 


K 2 FOGGY. 
id e 520 


ss 


The fact that in discrete systems the control is applied as a piecewise constant 
signal causes a roughness in the response that is directly related to the sample 
frequency. A specification on roughness leads to a specification on sample pe- 
riod, 7. This parameter is so important to the design of discrete controls that 
Chapter 11 is devoted to the decision. At this point, let it suffice to point out 
that the smaller the 7, the better the approximation to continuous control and the 
smaller the roughness. 


e A reasonable choice of T is one that results in at least 6 samples in the 
closed-loop rise time and better, smoother control results if there are more 
than 10 samples in the rise time. 
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€ Example 7.1 


Selection of Sample Period 
What is the relation between sampling frequency and system natural frequency if there are 10 


samples in a rise time? 


Solution. The sampling frequency in radians/sec is given by w, = 27: / T and we assume that 
rise time and natural frequency are related by Eq. (7.1) so that 


w, = 1.8/1, 

_ 18 

10T 

Substituting for T, we find that 

0.18%, 

Sa = 21 

or 

w, 8: w,/35. 


In other words, the sample rate, w,, should be 35 times faster than the natural frequency, c,. 


7.2. 


* 


From this example, we conclude that typically the sample frequency should 
be chosen to be 20 to 35 times the closed loop natural frequency. Slower sampling 
can be used but one would expect the resulting transients to be excessively rough. 

Robustness is the property that the dynamic response (including stability 
of course) is satisfactory not only for the nominal plant transfer function used 
for the design but also for the entire class of transfer functions that express 
the uncertainty of the designer about the dynamic environment in which the real 
controller is expected to operate. A more comprehensive discussion of robustness 
will be given when design using frequency response is considered. For root locus 
design, the natural measure of robustness is, in effect, gain margin. One can 
readily compare the system gain at the desired operating point and at the point(s) 
of onset of instability to determine how much gain change is acceptable. 


* A typical robustness requirement is that one should have gain margin of two 
so that the loop gain must double from the design value before reaching the 
stability boundary. 


Design by Emulation 
The elements of design by emulation have been covered already. Continuous 


control design is reviewed in Chapter 2, and in Chapter 6 the techniques of 
computing discrete equivalents are described. Control design by emulation is 
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mainly a combination of these two ideas. A controller design is done as if the 
system is to be continuous and, after a sample period is selected, a discrete 
equivalent is computed and used in place of the continuous design. This discrete 
controller may then be simulated and tested in the discrete control loop and 
modifications made, if necessary. 


Discrete Equivalent Controllers 


Techniques to compute discrete equivalents are described in general terms in 
Chapter 6, and their performance is illustrated on the basis of filter frequency 
responses. In this chapter, we are interested in controllers for feedback control 
and in performance comparisons on the basis of time responses. Any of the 
techniques from Chapter 6 can be used for the purpose; here we illustrate the 
use of the pole-zero mapping equivalent and explore the choice of sample period 
by example. An alternative approach that considers directly the performance for 
the discrete controller in the feedback context has been described by Anderson 
(1992). The method described in that reference leads to a multirate sampling 
problem of the sort which will be considered in Chapter 11. 


€ Example 7.2 


Figure 7.2 
Block diagram of the 
plant transfer function 


Design of Antenna Servo Controller 


A block diagram of the plant for an antenna angle-tracker is drawn in Fig. 7.2. The transfer 


function is given by 
l 
mi s(10s -- 1)" 


The specifications for this system are 
1. Overshoot to a step input less than 1696 
2. Settling time to 1% to be less than 10 sec 


3. Tracking error to a ramp input of siope 0.01 rad/sec to be less than 0.01 rad 
4. Sampling time to give at least 10 samples in a rise-time 


Design a controller for this system using the method of emulation. 


Solution. From the specifications one can estimate the acceptable region in the s-plane for 
the closed loop poles. From the overshoot requirement, we conclude that the damping ratio 
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Figure 7.3 

Acceptable pole 
locations for the 
antenna control 


must be ¢ > 0.5. From the settling time requirement, we conclude that the roots must have a 


real part of o. > 4.6/10 = 0.46. Finally, from the steady-state error requirement, we conclude 
that the velocity constant is constrained to be K, > ga = 1.0. Based on the limits on the 


damping ratio and the real-part of the poles, we can sketch the acceptable region for closed- 
loop poles in the s-plane as done in Fig. 7.3. Using lead compensation to cancel a piant pole, 
a first choice for controller might be 


10s + 1 
s+1 ` 


The root locus for this choice is drawn in Fig. 7.4 using MATLAB commands to enter the plant 
as a system, ant, the compensation as lead1, and the product as the open loop system, sysol. 


D(s) = (7.4) 


np= 1; 

dp = [10 1 0]; 

ant = tf(np,dp); 

nc = [10 1]; 

de= [1 1]; 

lead1 = tf(nc,dc); 
sysol = lead *ant; 


rlocus(sysol) 


Figure 7.4 

Root locus for 
compensated antenna 
model 
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The locations of the roots with K = 1 corresponding to a velocity constant of K, = | are 
marked by the dots computed by 


p = rlocus(sysol,1.0). 


The natural frequency for the given pole locations is essentially w, = 1, which corresponds to 
a rise time of t, = 1.8 sec . The indicated sampling period is thus T = t /10 = 0.18. A value 
of T — 0.2 will be used for this example and a value of T — 1.0 illustrated later to dramatize 
the effects of the choice of T. The compensation, D(s), given by Eq. (7.4), has two first-order 
factors; the zero is at s = —0.1, and the pole is at s = —1. The pole-zero mapping technique 
requires that each singularity is mapped according to z = e" ; therefore, we take D(z) of the 
form 
D()- K Í—À., 
z— pi 


and place a zero at 
a 909002 — 0.9802, 
and a pole at 
p, =e VO = 08187. 
To make the de gain of D(z) and D(s) be identical, we require that 
de gain = lim D(z) = lim D(s) 21 


| 1-— 0.9802 


——— a4 ae 
1 — 0.8187 V 
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Solving for K we have 
K — 9.15, 


and the design of the discrete equivalent compensation has the transfer function 
— 0.9802 


Z 
D(z) = 9.15 ————_. 7. 
i z — 08187 ps 
To compute this result in MATLAB, the command is 
leadid = c2d(lead1,0.2,’matched’). 
* 


€ Example 7.3 


Implementing the Controller 
Give the difference equation that corresponds to the D(z) given by Eq. (7.6). 
Solution. The transfer function is converted into a difference equation for implementation 


using the ideas developed in Chapter 4. Specifically, we first multiply top and bottom by z^! 
to obtain 


U (z) 1 — 0.98022! 
= 2 ~ 915 ———. 
DG) 213 T COPIE 


Ez) 
which can be restated as 


(1 — 0.8187 z7')U(z) = 9.15 (1 — 0.9802 z^! ) E (2). 


The z-transform expression above is converted to the difference equation form by noting that 
z^! represents a l-cycle delay. Thus 


u(k) = 0.8187 u(k — 1) + 9.15 (e(k) — 0.9802 e(k — 1)). 


This equation can be directly evaluated by a computer. 


1.2.23. 


Evaluation of the Design 


A description of a digital controller that is expected to satisfy the specifications 

for the antenna controller is now complete. A block diagram of the sampled-data 

system with discrete controller is given in Fig. 7.5. To analyze the behavior of 

this compensation, we first determine the z-transform of the continuous plant 
(Fig. 7.2) preceded by a zero-order hold (ZOH). 
z-1 a 

bp = £t hs (7.7) 


& 
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Figure 7.5 
Block diagram of sampled-data system 


which is 


—] ] 1 1 1 
oe = z|- 2 |e 
Z S7 as as-ca 
Using the tables in Appendix B, we find 
z= Tz z lez 
G(z) = oy ap t=} 
z tos a(z—1 az-e" 
T Az + B 
alz — l)e — e77)" 


where 
A=" +aT-1l, Ra Ter —aTe", 
For this example, with T = 0.2 and a = 0.1, this evaluates to 
z + 0.9934 
(z — 1)(z — 0.9802) ` 
Of course, this can be computed in MATLAB as the discrete model of the antenna 
by 


G(z) = 0.00199 (7.8) 


antd = c2d(ant,0.2). 


With the transfer function of the plant-plus-hold and the discrete controller, 
we can obtain the system difference equation and compute the step response, 
obviously most easily using a computer aided design package. The steps are 


sysold = lead1d * antd 

syscld = connect(sysold, [1 — 1]) 

step(syscld). 
In this case, the step response of the system with the discrete controller is shown 
in Fig. 7.6. The figure confirms that the discrete controller will perform satisfac- 
torily, albeit with somewhat increased overshoot. This simulation was carried out 


using the linear, discrete model of the system. As mentioned earlier, simulations 
can be embellished with the important nonlinearities such as friction and with 
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Figure 7.6 
Step response of the 5 
Hz controller 


Output, y and Control, 4/10 


0 2 4 6 8 10 12 14 16 18 20 
Time (sec) 


computation delays in order to assess their effects in addition to the effect of the 
discretization approximations. 


@ Example 7.4 


Antenna Servo with Slow Sampling 


Repeat the antenna design with a sample rate of 1 Hz (T = | sec); in this case the sample rate 
is approximately two samples per rise time. 


Solution. Repeating the calculations as in Eq. (7.7) with T = | sec results in 


z + 0.9672 
G(z) = 0.0484 ——————————-. 7:9 
e (z— I)e — 0.9048) qu) 
Furthermore, repeating the calculations that led to Eq. (7.6) but with T = | sec, we obtain 
z — 0.9048 
D(z) = 6.64 —————. k 
v z — 03679 CSI 


A plot of the step response of the resulting system is shown in Fig. 7.7 and shows substantial 
degradation of the response as a result of the slow sampling. A partial explanation of the extra 
overshoot can be obtained by looking at the Bode plot of the continuous design, computed with 
bode(sysol) and plotted in Fig. 7.8. The designed phase margin in the continuous system is seen 
to be 51.8?. As was indicated in Chapter 4, the sample and hold can be roughly approximated 
by a delay of 7/2 sec. At the crossover frequency of c, = 0.8 rad, and with sampling at 
T = 0.2, this corresponds only to ¢ = 0T = 4.5°. However, at T = 1.0, the sample-hold 
delay corresponds to $ = 23°. Thus the effective phase margin with a sample and hold is 
reduced to Pm = 51.8? — 23° = 28.8°. With this small phase margin, the effective damping 
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Figure 7.7 
Step response of the 
1-Hz controller 


: y with discrete controller 


Outpur, y and Control, u/10 
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Time (sec) 


Figure 7.8 10! 
Bode plot of the 

continuous design for 
the antenna control 10? 


107! 


Magnitude 


10°! 10° 10! 


-150 


Phase, degrees 


-180 


-200 | : w. 
107! 109 10! 
w (rad/sec) 


ratio is about 0.29 and the overshoot is expected to be about 0.4 rather than 0.16 as designed. 
The step response shows the actual M, — 0.5, so most of the extra overshoot is explained by 
the sample-hold delay. 


* 
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1.3 


7.3.1 


The examples presented here illustrate only a small selection of the alterna- 
tives for design by emulation. An immediate improvement would be expected if 
the continuous design were to include at the outset the characteristic T/2 delay of 
the sample and zero-order hold. Other than this modification, the other algorithms 
for discrete equivalent design can be tried. These include the very simple Euler 
rectangular rules, the bilinear transformations, and the several hold-equivalent 
methods. The triangle hold equivalent appears to be especially promising.’ There 
does not seem to be a dominant technique that is best for every case. The designer 
needs to explore alternatives based on the particular system, the required perfor- 
mance specifications and the practical constraints introduced by the technology 
to be used for implementation to guide the final choice. Here we now turn to 
consider the direct discrete design methods, beginning with design by use of the 
root locus in the z-plane. 


Direct Design by Root Locus in the z-Plane 


The root locus introduced by W. Evans is based on graphical rules for plotting 
the roots of a polynomial as a parameter is varied. The most common root locus 
is a plot of the roots of a closed-loop characteristic polynomial in the s-plane as 
the loop gain is varied from 0 to co. In linear discrete systems also the dynamic 
performance is largely determined by the roots of the closed-loop characteristic 
polynomial, in this case a polynomial in z with stability represented by having 
all roots inside the unit circle. The consequences for direct digital design are 
that one can use Evans’ root locus rules unchanged, but that the performance 
specifications must first be translated into the z-plane. 


z-Plane Specifications 


Figure 4.26 is a map of the unit disk in the z-plane on which is superimposed 
discrete system time responses that correspond to several typical z-plane pole 
locations. These can be used to make the translation of dynamic response per- 
formance specifications to a region of acceptable pole locations. For example, 
we have seen that rise time of a continuous second-order system is found to be 
inversely proportional to natural frequency as given by Eq. (7.1). Since poles in 
the s-plane are mapped to z = e, the natural frequency in s maps to the angle of 
the pole in polar coordinates in the z-plane as 6 = w, T where w, = =y mg Q,. 

Settling time is found to be inversely proportional to the magnitude of the real 
part of a pole in the s-plane (c) which maps to the radius of the pole in the z- 
plane as r = e? . The step response overshoot varies inversely with the damping 


] Inthe MATLAB Control Toolbox function c2d, the triangle hold is called a first-order hold in recognition 
of the fact that it is a first-order hold although it is noncausal. 


getting acceptable pole 
location in the z-piane 


Figure 7.9 

Lines of constant 
damping and natural 
frequency in the z-plane 
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ratio. Under the s-to-z mapping, lines of constant damping map into logarith- 
mic spirals in the z-plane. With these guidelines, one can readily estimate the 
dynamic response parameters based on the pole-zero pattern for simple transfer 
functions and can derive useful guidelines for design of more complex systems. 
In summary, to get the specifications on acceptable pole locations in the z-plane 


* Estimate the desired c, , ¢, and M, from the continuous-time response spec- 
ifications. Compute o = Z£o,. 


e Compute the radius r = e°", 


* Obtain a plot of the z-plane showing lines of fixed damping and w,. The 
MATLAB command zgrid will do this, plotting ¢ in steps of 0.1 from 0.1 to 
0.9 and w, = N:/10T for integer N from | to 10. An example is shown in 
Fig. 7.9. The command axis equal will cause the unit circle to be plotted as 
a circle and the command axis([-1 1 O 1]) will cause only the upper half of 
the circle to be plotted. 


* Mark the region of acceptable closed-loop pole locations on the plane. 


30 ST anT 4mll0T 
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@ Example 7.5 


Figure 7.10 
Plot of acceptable pole 
locations in the z-plane 


Z-Plane Specifications 


Indicate on a z-plane map the region of acceptable closed-loop poles for the antenna design of 
Example 7.2. 


Solution. The given specifications are that the system is to have a damping ratio of > 0.5, 
natural frequency of w, > 1, and the real-parts of the roots are to be greater than 0.5. The 
standard grid of the z-plane shows the curve corresponding to ¢ = 0.5. With the requirement 
that the roots correspond to a natural frequency greater than w, = 1, we need a plot on the 
standard grid corresponding to N = 10Tw /z = 2/m = 0.64. The last requirement means 
that the roots in the z-plane must be inside a circle of radius r < e~°°? = 0.9048. The curves 
corresponding to these criteria are marked in Fig. 7.10. 


* 


The specification of steady-state error also follows the continuous case but 
transferred to the z-plane when the controller is implemented in a computer and 
represented by its discrete transfer function D(z). The discrete transfer function 
of the plant is given by 


cQ - a-z5z [9]. (7.11) 


The closed-loop system can now be represented in a purely discrete manner. 
The discrete transfer functions of the controller, D(z), and the plant, G(z), are 


Imaginary Axis 


e 
p 


Real Axis 


discrete time final value 


discrete system type 
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combined as before according to Fig. 7.5, where it is now understood that the 
reference r and the disturbance w are sampled versions of their continuous 
counterparts. Proceeding as we did for the continuous system, suppose the input 
r is a step, r(k) = l(k), and the disturbance w is zero. The transform of the 
error is computed using the same block-diagram reduction tools that apply for 
continuous systems represented by their Laplace transforms, except that now we 
use D(z) and G(z). Doing this yields the transform of the error 


R(z) 
1+ D(z)G(z) 
— =a a ae 
~ z— 11+ D(z)G(z) 


The final value of e(k), if the closed loop system is stable with all roots of 
1 + DG —Oinside the unit circle, is, by Eq. (4.115) 


E(z)— 


. £ : 
e(00) = lim(z — DF I- PG) 
l 
~ 1+ DU)G() 
l 


= à 7.12 
[TE (7.12) 


Thus, D(1)G(1) is the position error constant, K,» of the Type 0 system in 
discrete time if the limit in Eq. (7.12) is finite. If DG has a pole at z = 1, then the 
error given by Eq. (7.12) is zero. Suppose there is a single pole at z = 1. Then 
we have a Type | system and we can compute the error to a unit ramp input, 
r(kT) - kTY(kT) as 

Tz 1 


B) —— X14 DG 


Now the steady-state error is 


E E ee 
oo) = nm — a 
à Ge Den 
Y Tz 
= lim ——————— 
Peg) T 06) 
id 
ud 743 
E (7.13) 


Thus the velocity constant of a Type | discrete system with unity feedback (as 
shown in Fig. 7.5) is 


K =lim C WURF DG) 
» zol Tz 
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which simplifies to 


et ee 


7.14 
lim Tz (7.14) 


Although it appears from Eq. (7.14) that K, is inversely proportional to 
the sample period, this is not the case if comparing for the same G(s). The 
reason is that the transfer function of G(z) computed from Eq. (7.11) is typically 
proportional to the sample period. This proportionality is exact for the very 
simple case where G(s) = 1/s, as can be seen by using Eq. (7.11) and inspecting 
Entry 4 in Appendix B.2. For systems with a finite K, and fast sample rates, this 
proportionality will be approximately correct. The result of this proportionality 
is that the de gain of a continuous plant alone preceded by a ZOH is essentially 
the same as that of the continuous plant. 


*Truxal’s Rule, Discrete Case 


Because systems of Type | occur frequently, it is useful to observe that the value 
of K, is fixed by the closed-loop poles and zeros by a relation given, for the 
continuous case, by Truxal (1955). Suppose the overall transfer function Y/R is 
H (z), and that H(z) has poles p, and zeros z,. Then we can write 


G= 7) — 2." ie) 


Hic eel po e 
(2) i —nlz-—p)-125—25 


(7.15) 


Now suppose that H(z) is the closed-loop transfer function that results from a 
Type 1 system, which implies that the steady-state error of this system to a step 
is zero and requires that 


H(1) 21. (7.16) 


Furthermore, by definition we can express the error to a ramp as 


E(2 =  R(2(1- H()) 
Tz 
— (1 - A(z)), 
ail - HU) 
and the final value of this error is given by 
Tz 


AA 
rp He 


therefore (omitting a factor of z in the numerator, which makes no difference in 
the result ) 


e(co) = lim(z =2h) 


E 


: 7.17 
ITO ug ae) ( ) 


7.3.2 
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Because of Eq. (7.16), the limit in Eq. (7.17) is indeterminate, and so we can use 
L'Hópital's rule 
l . (d/dz)(1 — H(z)) 
TK, >! (d/dz)(z — 1) 
= lim [5 E 
dz 


zl 


However, note that by using Eq. (7.16) again, at z = 1, we have 
d ld d 
— |n H(z) = —— A(z) = — H (2), 
= n H(z) Hd (z) "E (z) 
so that 
— = lim—< In H(z) 
IE = dg 49 


d | 
= lim —— {In K ————— 
lim ; L DH p) 


crc p que 

We note especially that the farther the poles of the closed-loop system are from 
z = l, the larger the velocity constant and the smaller the errors. Similarly, K " 
can be increased and the errors decreased by zeros close to z — 1. From the results 
of Chapter 4 on dynamic response, we recall that a zero close to z — 1 usually 
yields large overshoot and poor dynamic response. Thus is expressed one of the 
classic trade-off situations: We must balance small steady-state errors against 
good transient response. 


The Discrete Root Locus 


The root locus is the locus of points where roots of a characteristic equation can be 
found as some real parameter varies from zero to large values. From Fig. 7.5 and 
block-diagram analysis, the characteristic equation of the single-loop system is 


1+ D(z)G(z) = 0. (7.18) 


The significant thing about Eq. (7.18) is that this is exactly the same equation 
as that found for the s-plane root locus. The implication is that the mechanics 
of drawing the root loci are exactly the same in the z-plane as in the s-plane; 
the rules for the locus to be on the real axis, for asymptote construction, and 
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for arrival/departure angles are all unchanged from those developed for the s- 
plane and reviewed in Chapter 2. As mentioned earlier, the difference lies in 
the interpretation of the results because the pole locations in the z-plane mean 
different things than pole locations in the s-plane when we come to interpret 
system stability and dynamic response. 


€ Example 7.6 


Figure 7.11 

Root loci for antenna 
design: (a) 
Uncompensated system; 
(b) Locus with D(z) 
having the poles and 
zeros of Eq. (7.10) 


Discrete Root Locus Design 


Design the antenna system for the slow sampling case with T = ] sec. using the discrete root 
locus. 


Solution. The exact discrete model of the plant plus hold is given by the G(z) in Eq. (7.9). 
If the controller consisted simply of a proportional gain [ D(z) — K], the locus of roots versus 
K can be found by solving the characteristic equation 

z + 0.9672 m 
(z — 1)(z — 0.9048) — 
for many values of K . The result computed by rlocus(antd) is shown in Fig. 7.11 as the dashed 
arc marked (a). From study of the root locus we should remember that this locus, with two 
poles and one zero, is a circle centered at the zero (z = —0.9672) and breaking away from the 
real axis between the two real poles. 

From the root locus of the uncompensated system (Fig. 7.11(a)) it is clear that some 
dynamic compensation is required if we are to get satisfactory response from this system. The 


1 + 0.0484 K 


m | [ Im axis 


z = plane loci of roots of constant f and wa 
s--fo, t iol =F? 

zzen 

T = sampling pcriod 
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radius of the roots never gets less than 0.95, preventing the ¢, specification from being met. The 
system goes unstable at K = 2.25 [where K, —K, as can be verified by using Eq. ( 7.14)], 
which means that there is no stable value of gain that meets the steady-state error specification 
with this compensation. 

If we cancel the plant pole at 0.9048 with a zero and add a pole at 0.3679, we are using 
the lead compensation of Eq. (7.10). The root locus for this control versus the gain K [K was 
equal to 6.64 in Eq. (7.10)) computed with rlocus(sysold) is also sketched in Fig. 7.11 as the 
solid curve (b). The points, p, where K = 6.64 are computed with p = rlocus(sysold,6.64) 
and marked by dots. We can see that a damping ratio of about 0.2 is to be expected, as we 
have previously seen from the step response of Fig. 7.7. This gain, however, does result in the 
specified value of K, = 1 because this criterion was used in arriving at Eq. (7.10). The locus 
shows that increasing the gain, K, would lower the damping ratio still further. Better damping 
could be achieved by decreasing the gain, but then the criterion of steady-state error would be 
violated. It is therefore clear that this choice of compensation pole and zero cannot meet the 
specifications. 

A better choice of compensation can be expected if we transform the specifications into 
the z-plane and select the compensation so that the closed loop roots meet those values. The 
original specifications were K, > lt, < lOsec, and? > 0.5.If we transform the specifications 
to the z-plane we compute that the 1, specification requires that the roots be inside the radius 
r = e 95 = 0.61, and the overshoot requires that the roots are inside the ¢ = 0.5 spiral. The 
requirement that K, > 1 applies in either plane but is computed by Eq. (7.14) for the z-plane. 

It is typically advantageous to use the design obtained using emulation and to modify it 
using discrete design methods so that it is acceptable. The problem with the emulation-based 
design is that the damping is too low at the mandated gain, a situation that is typically remedied 
by adding more lead in the compensation. More lead is obtained in the s-plane by increasing the 
separation between the compensation's pole and zero; and the same holds true in the z-plane. 
Therefore, for a first try, let's keep the zero where it is (canceling the plant pole) and move the 
compensation pole to the left until the roots and K, are acceptable. After a few trials, we find 
that there is no pole location that satisfies all the requirements! Although moving the pole to 
the left of z = 0 will produce acceptable z-plane pole locations, the gain K, is not sufficiently 
high to meet the criterion for steady-state error. The only way to raise K, and to meet the 
requirements for damping and settling time is to move the zero to the left also. 

After some trial and error, we see that 


z — 0.80 

i cm (7.19) 
meets the required z-plane constraints for the complex roots and has a K, — 1.26. The root 
locus for Eq. (7.19) is shown in Fig. 7.12(a), and the roots corresponding to K — 6are marked 
by squares. The fact that all requirements seem to be met is encouraging, but there is an 
additional real root at z — 0.74 and a zero at z — 0.8, which may degrade the actual response 
from that expected if it were a second-order system. The actual time history is shown in 
Fig. 7.12(b). It shows that the overshoot is 29% and the settling time is 15 sec. Therefore, 
further iteration is required to improve the damping and to prevent the real root from slowing 
down the response. 

A compensation that achieves the desired result is 
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Figure 7.12 
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The damping and radius of the complex roots substantially exceed the specified limits, and 
K, = 1.04. Although the real root is slower than the previous design, it is very close to a 
zero that attenuates its contribution to the response. The root locus for all K's is shown in 
Fig. 7.13(a) and the time response for K — 13 in Fig. 7.13(b). 

Note that the pole of Eq. (7.20) is on the negative real z-plane axis. In general, placement 
of poles on the negative real axis should be done with some caution. In this case, however, no 
adverse effects resulted because all roots were in well-damped locations. As an example of 
what could happen, consider the compensation 
(z — 0.8) 
(z + 0.8) 

The root locus versus K and the step response are shown in Fig. 7.14. All roots are 
real with one root at z = —0.59. But this negative real axis root has = 0.2 and represents 
a damped sinusoid with frequency of w,/2. The output has very low overshoot, comes very 
close to meeting the settling time specification, and has K, = 1; however, the control, u, has 
large oscillations with a damping and frequency consistent with the negative real root. This 
indicates that there are “hidden oscillations" or “intersample ripple" in the output that are 
only apparent by computing the continuous plant output between sample points as is done in 
Fig. 7.14. The computation of the intersample behavior was carried out by computing it at 
a much higher sample rate than the digital controller, taking care that the control value was 
constant throughout the controller sample period. The MATLAB function ripple, included in 
the Digital Control Toolbox, has been written to do these calculations. Note that if only the 
output at the sample points had been determined, the system would appear to have very good 
response. This design uses much more control effort than that shown in Fig. 7.13, a fact that is 
usually very undesirable. So we see that a compensation pole in a lightly damped location on 
the negative real axis could lead to a poorly damped system pole and undesirable performance. 


D(z) 29 (7.21) 


+ 


In the design examples to this point, the computed output time histories 
have assumed that the control, u(k), was available from the computer at the 
sample instant. However, in a real system this is not always true. In the control 
implementation example in Table 3.1, we see that some time must pass between 
the sample of y(k) and the output of u(k) for the computer to calculate the value 
of u(k). This time delay is called latency and usually can be kept to a small 
fraction of the sample period with good programming and computer design. Its 
effect on performance can be evaluated precisely using the transform analysis of 
Section 4.4.2, the state-space analysis of Section 4.3.4, or the frequency response. 
The designer can usually determine the expected delay and account for it in the 
design. However, if not taken into account, the results can be serious as can be 
seen by an analysis using the root locus. 

Because a one-cycle delay has a z-transform of z^!, the effect of a full- 
cycle delay can be analyzed by adding z^' to the numerator of the controller 
representation. This will result in an additional pole at the origin of the z-plane. 
If there is a delay of two cycles, two poles will be added to the z-plane origin, 
and so on. 
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Figure 7.13 
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€ Example 7.7 


Effect of Unexpected Delay 


Add one cycle delay to the compensation of Eq. (7.21) and plot the resulting root locus and 
step response. 


Solution. The new controller representation is 
z — 0.88 

z(z4-0.5) 
The root locus and time response are shown in Fig. 7.15, which are both substantially changed 
from the same controller without the delay as shown in Fig. 7.13. The only difference is the 
new pole at z = 0. The severity of the one-cycle delay is due to the fact that this controller is 
operating at a very slow sample rate (six times the closed loop bandwidth). This sensitivity to 
delays is one of many reasons why one would prefer to avoid sampling at this slow a rate. 


D(z) = 13 (7.22) 


£i 


* 


Frequency Response Methods 


The frequency response methods for continuous control system design were de- 
veloped from the original work of Bode (1945) on feedback-amplifier techniques. 
Their attractiveness for design of continuous linear feedback systems depends on 
several ideas. 


]. The gain and phase curves for a rational transfer function can be easily plotted 
by hand. 


2. Ifa physical realization of the system is available, the frequency response can 
be measured experimentally without the necessity of having a mathematical 
model at all. 

3. Nyquist's stability criterion can be applied, and dynamic response specifica- 
tions can be readily interpreted in terms of gain and phase margins, which 
are easily seen on the plot of log gain and phase-versus-log frequency. 

4. The system error constants, mainly K, or K,, can be read directly from the 
low-frequency asymptote of the gain plot. 

S. The corrections to the gain and phase curves (and thus the corrections in the 
gain and phase margins) introduced by a trial pole or zero of a compensator 
can be quickly and easily computed, using the gain curve alone. 

6. The effect of pole, zero, or gain changes of a compensator on the speed of 
response (which is proportional to the crossover frequency) can be quickly 
and easily determined using the gain curve alone. 


Use of the frequency response in the design of continuous systems has 
been reviewed in Chapter 2 and the idea of discrete frequency responses has 
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been introduced in Chapter 4. In order to apply these concepts to the design of 
digital controls, the basic results on stability and performance must be translated 
to the discrete domain. The concepts are the same as for continuous systems, 
but plots of the magnitude and phase of a discrete transfer function, H(z), are 
accomplished by letting z take on values around the unit circle, z = e/*" , that is, 


magnitude = |H(z)l,;r. 
phase = IHQ),.. 


€ Example 7.8 


Discrete Bode Plot 


Plot the discrete frequency response corresponding to the plant transfer function 


G(s) 


= sis+ 1 (23) 


sampling with a zero order hold at T = 0.2, 1, and 2 seconds and compare with the continuous 
response. 


Solution. The discrete transfer functions for the specified sampling periods are computed 
with c2d.m as 

sysc = tf((1],[11 0]) 

sysd 1 =c2d(sysc,0.2) 

sysd2 = c2d(sysc, 1) 

sysd3=c2d(sysc, 2) 


with transfer functions 


z + 0.9355 
NT m EE E E 
Io e D-08187) ^ ip 

z 10.718 
Gite Oei es, EY 
i0 Gets e 

0.523 

G0) 111355 Se eee (7.24) 


(z — 1)(z — 0.135) 
The frequency responses of Eq. (7.23) and Eq. (7.24) are plotted in Fig. 7.16 using the 
statement 


bode(sysc,’ —', sysd1,’ -.’,sysd2,’:’,sysd3,’—’). 
+ 


It is clear that the curves for the discrete systems are nearly coincident with 
the continuous plot for low frequencies but deviate substantially as the frequency 
approaches 7:/T in each case. In particular, the amplitude plots do not approach 
the simple asymptotes used in the hand-plotting procedures developed by Bode, 
and his theorem relating the phase to the derivative of the magnitude curve on 


Figure 7.16 
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a log-log plot does not apply. The primary effect of sampling is to cause an 
additional phase lag. Fig. 7.17 shows this additional phase lag by plotting the 
phase difference, Ag, between the continuous case and the discrete cases. The 
approximation to the discrete phase lag given by 

wT 


^= —- (1.25) 


is also shown and demonstrates the accuracy of this approximation for sample 
rates up to wT = 2/2, which corresponds to frequencies up to 1/4 the sample 
rate. Crossover frequencies (where magnitude = 1) for designs will almost always 
be lower than 1/4 the sample rate; therefore, one can obtain a good estimate of 
the phase margin if a sample and hold is introduced into a continuous design by 
simply subtracting the factor wT/2 from the phase of the continuous design's 
phase margin. 

The inability to use the standard plotting guidelines detracts from the ease 
with which a designer can predict the effect of pole and zero changes on the 
frequency response. Therefore, points 1, 5, and 6 above are less true for discrete 
frequency-response design using the z-transform than they are for continuous 
systems and we are more dependent on computer aids in the discrete case. 
With some care in the interpretations however, points 2, 3, and 4 are essentially 
unchanged. All these points will be discussed further in this chapter as they pertain 
to design using the discrete frequency response. We begin with the discrete form 
of the Nyquist stability criterion and follow with a discussion of specifications 
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Figure 7.17 
Phase lag due to 
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of performance and stability robustness as expressed in the frequency domain 
before we introduce the design techniques directly. 


Nyquist Stability Criterion 


For continuous systems, the Nyquist stability criterion seeks to determine whether 
there are any zeros of the closed-loop characteristic equation 


L+ K D(s)G(s) 20 (7.26) 


in the right half-plane. The method establishes stability by determining the excess 
of zeros over poles of the characteristic equation in the right-half plane by plotting 
K D(s)G(s) for s along the D contour that encloses the entire right-hand side 
(unstable region) of the s-plane as sketched in Fig. 7.18(a). 

It is assumed that the designer knows the number of (unstable) poles that 
are inside the contour and from the plot can then determine the number of 
zeros of Eq. (7.26) in the unstable region that is the same as the number of 
closed-loop system unstable poles. The entire contour evaluation is fixed by 
examining K D(s)G(s) over s = jo for 0 < w < co, which is the frequency- 
response evaluation of the open-loop system. For experimental data, the plot is 
to be made for Oy LOS nigh? where Dow is small enough to allow the low- 
frequency behavior to be decided (usually the gain is very high at w,,, and the 
phase is approaching a fixed multiple of 90°), and where c, , is taken to be high 
enough that it is known that the magnitude is much less than | for all higher 
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Figure 7.18 

Contours used for Imag 
Nyquist stability criterion: 
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frequencies. Fig. 7.1 8(a) shows the full D contour and the portion of the contour 
fore, S W L Os igh’ The indentation near w = 0 excludes the (known) poles of 
K DG at s = 0 from the unstable region; the map of this small semicircle is done 


analytically by letting s = r’* forr «1,2 <@ x 5. 
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The specific statement of the Nyquist stability criterion for continuous sys- 


tems is 


Z—P- N; (7.27) 


where 


e Z =the number of unstable zeros of Eq. (7.26) (that are unstable closed-loop 


poles). For stability, Z = 0. 


e P = the number of unstable (open-loop) poles of K D(s)G(s). 
e N = the net number of encirclements of the —1 point for the contour eval- 


uation of K D(s)G(s) in the same direction as that taken by s along D as 
shown in Fig 7.18(a). Usually s is taken clockwise around D and therefore 
clockwise encirclements are taken as positive. 


For the common case of a stable open-loop system (P = 0) the closed-loop 


system is stable if and only if the contour evaluation of K D(s)G(s) does not 
encircle the —1 point. For unstable open-loop systems, the closed-loop system 
is stable if and only if the contour evaluation encircles the —1 point counter to 
the s direction as many times as there are unstable open-loop poles (N = —P 
in Eq. (7.27)). The proof of this criterion relies on Cauchy's principle of the 
argument and is given in most introductory textbooks on continuous control 
systems. The elementary interpretation is based on the following points: 


If we take values of s on a contour in the s-plane that encloses the unstable 
region, we can plot the corresponding values of the function | + K D(s)G(s) 
in an image plane. 

If the s-plane contour encircles a zero of 1 + K DG inacertain direction, the 
image contour will encircle the origin in the same direction. In the s-plane, 
the angle of the vector from the zero to s on the contour goes through 360°. 
If the s-plane contour encircles a pole of 1 + K DG, the image contour will 
encircle the origin in the opposite direction. In this case, the s-plane vector 
angle also goes through 360? but the contribution to the image angle is a 
negative 360°. 

Thus the net number of same-direction encirclements, N, equals the differ- 
ence N = Z — P? 

The origin of the 1 + K DG plane is the same as the point K DG = —1 so 
we gan plot K DG and count N as the encirclements of the — 1 point just as 
well. 


N 


It is much easier to remember same-direction and opposite-direction encirclements than to keep 
clockwise and counter-clockwise distinguished. 

When the characteristic equation is written as I + K DG, we can plot only DG and count encirclements 
of DG = -4 and thus easily consider the effects of K on stability and stability margins. 
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* From all of this, Eq. (7.27) follows immediately. 


For the discrete case, the ideas are identical; the only difference is that the 
unstable region of the z-plane is the outside of the unit circle and it is awkward to 
visualize a contour that encloses this region. The problem can be avoided by the 
simple device of considering the encirclement of the stable region and calculating 
the stability result from that. The characteristic equation of the discrete system is 
written as 


1 + K D(z)G(z) = 0, (7.28) 


and, as in the continuous case, it is assumed that the number, P, of unstable poles 
of K D(z)G(z), which are also unstable poles of 1 + K D(z)G(z), is known and 
we wish to determine the number, Z, of unstable zeros of Eq. (7.28), which 
are the unstable closed-loop poles. Examination of Eq. (7.28) reveals that the 
(possibly unknown) total number of stable plus unstable poles, n, is the same 
as the total number of zeros of Eq. (7.28). Thus the number of stable zeros is 
n — Z and the number of stable poles is n — P. Following the mapping result 
used by Nyquist, the map of 1 + K D(z)G(z) for the z contour of Fig. 7.18(b) 
will encircle the origin N times where 


N = {number of stable zeros} — {number of stable poles} 
= {n — Z} — {n — P) 
=P-Z. 


Therefore, the Nyquist stability criterion for discrete systems is 
Z=P-N. (7.29) 


In summary, the discrete Nyquist stability criterion is 


e Determine the number, P, of unstable poles of K DG. 


e Plot K D(z)G(z) for the unit circle, z = e/^" and 0 € wT x 27. This is 
a counter-clockwise path around the unit circle. Points for the plot can be 
conveniently taken from a discrete Bode plot of K DG. 


e Set N equal to the net number of counter-clockwise (same direction) encir- 
clements of the point —1 on the plot. 


e Compute Z = P — N. The system is stable if and only if Z = 0. 


€ Example 7.9 Nyquist Stability 


Evaluate the stability of the unity feedback discrete system with the plant transfer function 


G(s) (7.30) 


] 
~ s(s +1)’ 
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with sampling at the rate of 1/2 Hz or T — 2 andzero-order hold. The controller is proportional 
discrete feedback [K D(z) = K]. 


Solution. The discrete transfer function at the specified sampling rate and ZOH is given by 
Sysd3 of Example 7.8 with transfer function 


1.135(z + 0.523) 
(z — 1)(z — 0.135)’ 


and the plot of magnitude and phase of G(z) for z = e/^" is included in Fig. 7.16 for 0 < 
wT < x. Using the data from Fig. 7.16 for T = 2, the plot of K D(z)G(z) can be drawn as 
shown in Fig. 7.19. The plotis marked with corresponding points from Fig. 7.18(b) to facilitate 
understanding the results. Note that the portion from a — b — c is directly from Fig. 7.16, 
and the section from c — d — e is the same information reflected about the real axis. The 
large semicircle from e — a is the analytically drawn map of the small semicircle about z — 1 
drawn by letting (z — 1) = re"? in Eq. (7.31) forr & land-Z <@< 5. Because this system 


G(z) = (7:31) 


is open-loop stable and there are no —1 point encirclements, die conclude that the closed-loop 
system will be stable as plotted for K — 1. Note that all the necessary information to determine 
stability is contained in the Bode plot information from Fig. 7.16, which determines the portion 
from a — c in Fig. 7.19. Using MATLAB, the plot can be made by the statements 


nyquist(sysd3) 
axis equal 
grid 


The axis statement sets the x and y axes to have equal increments. 


Figure 7.19 
Nyquist plot of 
Example 7.9 
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Design Specifications in the Frequency Domain 


Gain and Phase Margins 


The Nyquist plot shows the number of encirclements and thus the stability of 
the closed-loop system. Gain and phase margins are defined so as to provide a 
two-point measure of how close the Nyquist plot is to encircling the —1 point, 
and they are identical to the definitions developed for continuous systems. Gain 
margin (GM) is the factor by which the gain can be increased before causing the 
system to go unstable, and is usually the inverse of the magnitude of D(z)G(z) 
when its phase is 180°. The phase margin (PM) is the difference between — 180° 
and the phase of D(z)G(z) when its amplitude is 1. The PM is a measure of 
how much additional phase lag or time delay can be tolerated in the loop before 
instability results. 


€ Example 7.10 


Stability Margins 
Consider the open-loop transfer function 
l 
s(s + 1)’ i 
with ZOH and sample rate of 5 Hz. The discrete transfer function is given by 
(z + 3.38)(z + 0.242) 
(z — 1)(z — 0.8187)? ` 


What are the gain and phase margins when in a loop with proportional discrete feedback 
(D(z) = K =1)? 


G(s) = 


G(z) = 0.0012 


Solution. The discrete Bode plot is given in Fig. 7.20 and the portion of the Nyquist plot 
representing the frequency responsein the vicinity of — 1 is plottedin Fig. 7.21. Unlike Example 
13.5 which had a very slow sample rate, the higher sample rate here causes the magnitude 
to be essentially zero at wT = x, and hence the Nyquist plot goes almost to the origin. The 
plot is very similar to what would result for a continuous controller. Furthermore, just as in 
the continuous case, there are no — l point encirclements if K = l as plotted (N = 0), and 
since there are no unstable poles (P = 0), the system will be stable at this gain (Z = 0). If 
the Nyquist plot is multiplied by 1.8, then the plot will go through the —1 point. Thus the gain 
margin is GM = 1.8. For values of K > 1.8, the —1 point lies within the contour thus creating 
two encirclements (N = 2) and two unstable closed-loop poles (Z = 2). As indicated on the 
plot, the angle of the plot when the gain is | is 18° from the negative axis, so the phase margin 
is 18°. 


phase margin and 
damping ratio 


* 


For continuous systems, it has been observed that the phase margin, P M, 
is related to the damping ratio, ¢, for a second-order system by the approximate 
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Figure 7.20 
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Figure 7.21 
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relation, ¢ = PM/100. This relationship is examined in Fig. 7.22 for the con- 
tinuous case and for discrete systems with two values of the sample rate. Figure 
7.22 was generated by evaluating the damping ratio of the closed-loop system 


Figure 7.22 
Damping ratio of a 
second-order system 
versus phase margin 
(PM) 
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that resulted when discrete proportional feedback was used with the open-loop 
system 


ope s(s +1) 

A z-transform analysis of this system resulted in z-plane roots that were then 
transformed back to the s-plane via the inverse of z = e^". The ¢ of the resulting s- 
plane roots are plotted in the figure. As the feedback gain was varied, the damping 
ratio and phase margin were related as shown in Fig. 7.22. The actual sample rates 
used in the figure are 1 Hz and 5 Hz, which represent 6 and 30 times the open-loop 
system pole at 1 rad/sec. The conclusion to be drawn from Fig. 7.22 is that the 
PM from a discrete z-plane frequency response analysis carries essentially the 
same implications about the damping ratio of the closed-loop system as it does 
for continuous systems. For second-order systems without zeros, the relationship 
between ¢ and P M in the figure shows that the approximation of ¢ = PM /100 is 
equally valid for continuous and discrete systems with reasonably fast sampling. 
For higher-order systems, the damping of the individual modes needs to be 
determined using other methods. 


Tracking Error in Terms of the Sensitivity Function 


The gain and phase margins give useful information about the relative stability of 
nominal systems but can be very misleading as guides to the design of realistic 
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sensitivity function 


vector gain margin 


control problems. A more accurate margin can be given in terms of the sensitivity 
function. For the unity feedback system drawn in Fig. 7.1, the error is given by 


E(jo) = R= S(jo)R, (7.32) 


1 
1+ DG 
where we have defined the sensitivity function S. In addition to being a factor of 
the system error, the sensitivity function is also the reciprocal of the distance of 
the Nyquist curve, DG, from the critical point —1. A large value for S indicates 
a Nyquist plot that comes close to the point of instability. The maximum value 
of |S| is often a more accurate measure of stability margin than either gain or 
phase margin alone. For example, in Fig. 7.23 a Nyquist plot is sketched that is 
much closer to instability than either gain or phase margin would indicate. The 
vector gain margin (VGM) is defined as the gain margin in the direction of the 
worst possible phase. For example, if the Nyquist plot comes closest to —1 on 
the negative real axis, then the vector margin is the same as the standard gain 
margin. From the geometry of the Nyquist plot, the distance from the curve to 
—l is 1 + DG = 4$ and with the definition that 


S = max |S], 


it follows that the distance of the closest point on the Nyquist curve from —1 is = 
If the Nyquist curve came this close to the —1 point on the real axis, it would pass 
through | — + and by definition, the product VGM x (1 — =) = |. Therefore 
we have that ^ 1 


S 
VGM = —39—, 7:33 
CES) (7.33) 


oo 


The VGM and related geometry are marked on the Nyquist plot in Fig. 7.23. 
We can express more complete frequency domain design specifications than 
any of these margins if we first give frequency descriptions for the external 
reference and disturbance signals. For example, we have described so far dynamic 
performance by the transient response to simple steps and ramps. A more realistic 
description of the actual complex input signals is to represent them as random 
processes with corresponding frequency spectra. A less sophisticated description 
which is adequate for our purposes is to assume that the signals can be represented 
as a sum of sinusoids with frequencies in a specified range. For example, we can 
usually describe the frequency content of the reference input as a sum of sinusoids 
with relative amplitudes given by a magnitude function |R] such as that plotted 
in Fig. 7.24, which represents a signal with sinusoidal components each having 
about the same amplitude of 150 up to some value w, and very small amplitudes 
for frequencies above that. With this assumption the response specification can 
be expressed by a statement such as “the magnitude of the system error is to be 
less than the bound e, (a value such as 0.01 that defines the required tracking 


Figure 7.23 
A Nyquist plot showing 


the vector gain margin 


Figure 7.24 

Sketch of typical 
specification of 
frequency content for 


reference input tracking 
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accuracy) for any sinusoid of frequency w, and of amplitude given by |R( jop” 
We can now define the size of the error in terms of the sensitivity function and the 
amplitude of the input. Using Eq. (7.32 ), the frequency-based error specification 


can be expressed as | E| = |5| |R| < e,. In order to normalize the problem without 
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€ Example 7.11 


defining both the spectrum R and the error bound each time, we define the real 
function of frequency W,(w) = |R| /e, and the requirement can be written as 


IS| W, < 1. (7.34) 


Performance Bound Function 


A unity feedback system is to have an error less than 0.005 for all unity amplitude sinusoids 
having frequency below 100 Hz. Draw the performance frequency function W, (c) for this 
design. 


Solution. The spectrum, from the problem description, is unity for 0 € w < 2007. Since 
e, = 0.005, the required function is given by a rectangle of amplitude 1/0.005 = 200 over the 
given range. The function is plotted in Fig. 7.25. 


Figure 7.25 

Plot of performance 
frequency function for 
Example 7.11 
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The expression in Eq. (7.34) can be translated to the more familiar Bode plot 
coordinates and given as a requirement on the open-loop gain, DG, by observing 
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that over the frequency range when errors are small the loop gain is large. In that 


case |S| ~ [DG] and the requirement is approximately 


G 
G= W. (1.35) 


Stability Robustness in Terms of the 
Sensitivity Function 


In addition to the requirements on dynamic performance, the designer is usually 
required to design for stability robustness. The models used for design are almost 
always only approximations to the real system. Many small effects are omitted, 
such as slight flexibility in structural members or parasitic electrical elements 
in an electronic circuit. Usually these effects influence the transfer function at 
frequencies above the control bandwidth and a nominal transfer function, Gx 
is used for the design. However, while the design is done for the nominal plant 
transfer function, the actual system is expected to be stable for an entire class 
of transfer functions that represent the range of changes that are expected to be 
faced as all elements are included and as changes due to temperature, age, and 
other environmental factors vary the plant dynamics from the nominal case. A 
realistic way to express plant uncertainty is to describe the plant transfer function 
as having a multiplicative uncertainty as 


G(jo) = G, Go) + w,(@)AGo)]. (7.36) 


In Eq. (7.36), G (jo) is the nominal plant transfer function, and the real function, 
w,(w), is a magnitude function that expresses the size of changes as a function 
of frequency that the transfer function is expected to experience and is known 
to be less than some upper bound W,(w). The value of the bound W, is almost 
always very small for low frequencies (we know the model very well there) and 
increases substantially as we go to high frequencies where parasitic parameters 
come into play and unmodeled structural flexibility is common. 


@ Example 7.12 


Model Uncertainty 


A magnetic memory read/write head assembly can be well modelled at low frequencies as 


G,(s) = T (7.37) 
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However, the arm supporting the read/write head has some lightly damped flexibility with 
uncertain resonant frequency. With scaling to place the resonant frequency at w,, and damping 
B, the more accurate model is represented as 


————————— (7.38) 


Compute the model uncertainty function for this case. 


Solution. The model transfer function given by Eq. (7.38) can be written as 


K 
G(s)-—|ti- 


na s \? 5 
2 es 
[0] @ 


o o 


(7.39) 
Comparing Eq. (7.39) with Eq. (7.36), the model uncertainty function is given by 
wo) = |- , (7.40) 


$C Ss 
(=) Le 
e, o, s=jw 


A plot of this function is given in Fig. 7.26 for «o, = l and B = .03. 


In general, a model uncertainty bound is small for low frequencies and large 
for higher frequencies. A typical shape is sketched in Fig. 7.27. The complex 
function, A (jo), represents the uncertainty in phase and is restricted only by the 
constraint 


|A(jw)| s I. (7.41) 


Stability robustness requires that we construct a control design for G (s) which 
will result in a stable system for any transfer function described by Eq. (7.36). 
To derive the requirement, we begin with the assumption that the nominal design 
has been done and is stable so that the Nyquist plot of DG, satisfies the Nyquist 
stability criterion. In this case, the equation 1 + D(jo)G, (jo) = 0 is never 
satisfied for any real frequency. If the system is to have stability robustness, 
the characteristic equation using the uncertain plant as described by Eq. (7.36) 
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complementary 
sensitivity function 


must not go to zero for any real frequency for any value of either W, or A. The 
requirement can be written as a function of jw in the form 


1+ DG #0, (7.42) 
1+ DG,[1 + w,^] #0, (7.43) 
(1+ DG (i eet -w a) #0, 
o 1+DG, ? 
(1+ DG,)(1+Tw,d) #0, (7.44) 


where the complementary sensitivity function is defined as 7 (jw) E DG,/ 
(1+ DG,) = 1— 5. Because the nominal system is stable, the first term in 
Eq. (7.42), (1 + D(jw)G,(j)), is not zero for any w. Thus, if Eq. (7.42) is not 
to be zero for any frequency, any w, < W,, or for any phase function A, then it 
is necessary and sufficient that 


|Tw,A| «e 
ITI |w,|IAI < 1, 
which reduces to 
Iz | Wo <1, (7.45) 


making use of Eq. (7.41) and the fact that w, is bounded by W,. As with the 
performance specification, for single-input-single-output unity feedback systems 
this requirement can be approximated by a more convenient form. Over the 
range of high frequencies where there is significant model uncertainty and W, is 
non-negligible, DG, is small. Therefore we can approximate 7 & DG, and the 
constraint becomes 


IDG |W, < 1 


|DG,| < di (7.46) 


w, 


€ Example 7.13 


Stability Robustness Function 


The uncertainty in the model of a disk read/write head assembly is given in Example 7.12. 
Suppose it is known that the parameter B is restricted to the range .03 < B < .3 and that the 
resonant frequency w, is known to be no less than 1.0. Plot the stability robustness bound, 
1/ W, for this problem. 


Solution. The function 1/w, (c) is plotted for B = .03 and w = | and w = 6 in Fig. 7.28 
using bode. It is clear that if the resonant frequency can take on any value greater than 1.0, 
then the bound 1/ W, needs to be extended at the value .03 for all frequencies greater than 1.0. 
The boundary line is marked with hatching in Fig. 7.28. 


* 
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Figure 7.28 104 F 
Plot of the stability gi 
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In practice, the magnitude of the loop gain is plotted on log-log coordinates, 
and the constraints of Eq. (7.35) and Eq. (7.46) are included on the same plot. A 
Figure 7.29 10? 


Typical design limitations 
as displayed on a Bode 
magnitude plot 


10! *.— Bode Plot 
A 


Magnitude 
S 
co 


107! 


10°? : : 
10 107! 109 10! 10? 10? 10* 
w (rad/sec) 


254 Chapter 7 


Bode's gain-phase 
formula 


Bode's gain-phase 
integral 
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typical sketch is drawn in Fig. 7.29. The designer is expected to construct a loop 
gain that will stay above W, for frequencies below w,, cross over the magnitude- 
of-1 line (log(| DG|) = 0) in the range w, < w < c, and stay below 1/ W, for 
frequencies above w,. We have developed the design constraints Eq. (7.35) and 
Eq. (7.46) in terms of jw as for continuous systems. The algebra and the equations 
are the same for the discrete case; one need only substitute the discrete transfer 
functions for the continuous ones and use the variable e^", 


Limitations on Design: Continuous Case 


One of the major contributions of Bode was to derive important limitations on 
transfer functions that set limits on achievable design specifications. For example, 
we would like to have the system error kept small for the widest possible range of 
frequencies and yet to have a system that is stable in the presence of uncertainty 
in the plant transfer function. In terms of the plot in Fig. 7.29, we want W, and W, 
to be large in their respective frequency ranges and for w, to be close to w,. Thus 
the loop gain is expected to plunge with a large negative slope from being greater 
than W, to being less than 1/ W, in a short span, while maintaining stability 
which can be expressed as having a good phase margin. Bode showed that this is 
impossible with a linear controller by showing that the minimum possible phase 
is determined by an integral depending on the slope of the magnitude curve. A 
common form of the formula for phase is 


1 f*? (dM u 
o(w,) = > Í (=) In(coth |= du, (7.47) 
3E JC 2 


o Ndu 


where M = In(| DC|) and u = /n(*), and thus P is the magnitude slope on the 
log-log (Bode) plot. The weighting function in Eq. (7.47) is concentrated near 
c, , and if the slope is constant for a substantial range around c,, then the formula 
can be approximated by 


u=0 
If, for example, the phase is to be kept above —150° to maintain a 30° phase 
margin, then the magnitude slope is estimated to be 


aM "4 > 150 m ) 
du m 180 
= — 1.667 


in the neighborhood of crossover. If we try to make the average slope steeper 
(more negative) than this near crossover, we will lose the phase margin. From 
this condition there developed the design rule: 


* Theasymptotes of the Bode plot magnitude, which are restricted to be integral 
values for rational functions, should be made to crossover the magnitude 1 


sensitivity integral 


constraint on 
non-minimum phase 
systems 
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line at a slope of —1 over a frequency range of about one decade around the 
cross-over frequency. 


Modifications to this rule need of course to be made in particular cases, but 
the limitation expressed by Eq. (7.47) is a hard limit that cannot be avoided. 

In Freudenberg and Looze (1985) an extension to another of Bode’s relations 
was derived. This is a constraint on the integral of the sensitivity function depen- 
dent on the presence of open-loop right-half plane poles. Suppose the loop gain 
DG, has n, poles, p,, in the right-half plane and “rolls off” at high frequencies 
at a slope faster than —1. For rational functions, this means that there is an excess 
of at least two more total poles than zeros. Then it can be shown that 


R 


f In(IS|)\dw =x Y * Re{p,}. (1.48) 
0 i=l 


If there are no right-half plane poles, then the integral is zero. This means that if we 
make the log of the sensitivity function very negative over some frequency band 
to reduce errors in that band, then of necessity In |S| will be positive over another 
part of the band and errors will be amplified there. If there are unstable poles, the 
situation is worse because the positive area where sensitivity magnifies the error 
must exceed the negative area where the error is reduced by the feedback. There 
are also consequences if DG, has any zeros in the right-half plane. If the open- 
loop system has no zeros in the right-half plane, then it is in principle possible to 
keep the magnitude of the sensitivity small by spreading the sensitivity increase 
over all positive frequencies to infinity but such a design requires an excessive 
bandwidth and is rarely practical. If a specific bandwidth is imposed, then the 
sensitivity function is constrained to take on a finite, possibly large, positive value 
at some point below the bandwidth and a large value of |S] „ leads to a small 
VGM and generally an unsatisfactory design. 

An alternative to Eq. (7.48) is also true if there is a (non-minimum-phase) 
zero of DG, in the right-half plane. Suppose the zero is located at z, = c, + jw, 
where c, > 0. Again we assume there are n, right-half plane poles at locations 
p, with conjugate values p;. Now the condition can be expressed as a two-sided 
weighted integral 


oo o ng 
In(|S|) ———*—-dw = l 
Í n(| Ir Cc w TE n 


eo o i= 


PEG gy 


i o 
In this case, we do not have the “roll-off” restriction and there is no possibility 
of spreading the positive area over all high frequencies because the weighting 
function goes to zero with frequency. The important point about this integral is 
that if the non-minimum phase zero is close to a right-half plane pole, the right 
side of the integral can be very large and the excess of positive area is required 
to be correspondingly large. Based on this result, one expects especially great 
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discrete sensitivity 
integral 


Figure 7.30 

Discrete Bode plot of 
1/s? plant with 
zero-order hold 


difficulty meeting specifications on sensitivity with a system having right-half 
plane poles and zeros close together. 


Limitations on Design: Discrete Case 


In the discrete case, the relation between gain slope and phase does not hold al- 
though it is approximately true for frequencies well below the Nyquist frequency. 
The situation is illustrated by the Bode plot in Fig. 7.30 for the plant G = 1/s?. 
Notice that the phase is always slightly more negative than the —180° one would 
get for this plant in the continuous case and deviates more as we approach the 
Nyquist limit at 2/7. The effect is approximated by the delay of 7/2 due to the 
sample and ZOH. From this example, one suspects that the restriction on phase 
due to gain slope is more severe in the discrete case than in the continuous case. 

In Sung and Hara (1988) the discrete versions of these design limitations 
are derived. We consider the single-loop unity feedback structure and define the 
sensitivity function as S = = as before. We also assume that the open-loop 
transfer function D(z)G(z) has n, poles outside the unit circle at z, = ret, 
r, > 1. It can be shown that 


f j In(|S(e/*?) Dao = x Y In(r,). (7.50) 
9 izl 


The implications of Eq. (7.50) are the same as in the continuous case except for 
the fact that the integral in Eq. (7.50) is over a finite limit. If we require that 
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sensitivity is small (negative log) over a given range of frequencies, there is only 
a finite frequency range over which we can spread the required sensitivity gain 
or “positive log” area. Again, unstable poles make the situation worse and the 
effect increases if the poles are located far from the unit circle. We can illustrate 
the implications of Eq. (7.50) by two simple examples. Consider a stable generic 
model of a chemical process and an unstable model of a magnetic levitation 
system. The two transfer functions are 


G(s) = chemical process, 


1 
(s 
l 
G,(s) = TEN magnetic levitation. 
ÁN -— 


In each case we will include a zero-order hold and sample with T — 0.2 and 
in each case the controller transfer function is D — 15 Le, selected to give 
approximately the same rise time and bandwidth. The step responses are shown 
in Fig. 7.31(a) and the sensitivity plots are shown in Fig. 7.31(b). Notice the 
substantially larger value of the sensitivity for the unstable plant compared to that 
of the stable one. The vector gain margin for C, is 5.4 while that for G, is 2.25, 
less than half as much. To plot the sensitivity magnitude, it is necessary to obtain 
a system having the transfer function of S. This can be done using the feedback 
function if a representation of the open loop system DG is given as, for example, 
the open loop discrete system sysold. The expression syscl = feedback(sys1,sys2) 
generates the loop with forward system sys! and feedback system sys2. For 
sensitivity, DG is the feedback system, and we need to construct the dummy 
gain of | for the forward system. This can be done with the statement sysf = 
Ss(0,0,0, 1, TS). Finally, the sensitivity is given by sens = feedback(sysf,sysold). 
The plot is given by the statements 


[mag,ph,w]=bode(sens); 
semilogy(w,mag) 
grid. 


The weighted integral of the discrete sensitivity function is similar to that of 
the continuous case. We assume again that the system has n, poles outside the 
unit circle at a, = Few, ie l, and conjugate @,, and also has a zero outside 
the unit circle at B, = r,e’*, r, > 1. Then 


= Rei w |i- 
)i In(|S (e/*) |) = —2-* m LU . (7.51) 
= r; —2r,cos($ — 6) + 1 Bic d, 
The main consequence of this constraint is that it expresses a limitation imposed 
by the non-minimum phase zero on the sensitivity function. The constraint is 
especially severe if there is a non-minimum phase zero near an unstable pole 


(B, © a). 


izl 
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Figure 7.31 
Comparisons of a stable 
process control with an 
unstable magnetic 
levitation: (a) step 
responses (b) sensitivity 
plots 
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Low Frequency Gains and Error Coefficients 


The steady-state error constants for polynomial inputs for discrete systems were 
established in Section 7.2 and are given by 


K = lim D(z)G(z) 
P z21 
for a Type 0 system, and by 


p -mE DO 


lim Tz (1.52) 


for a Type 1 system. In the frequency domain, for a Type 0 system, the procedure 
is identical to the continuous case. Since z = e/^^,2 > 1 implies that wT — 0, 
and the magnitude frequency-response plot will show a constant value on the low- 
frequency asymptote which is equal to K . For a Type | system, the procedure 
is again identical to the continuous case in that the magnitude of D(z)G(z) at 
w = | on the low-frequency asymptote is equal to K,. This can be seen from 
Eq. (7.52) if we note that for wT — 0, e^" = 1 + jwT. Therefore 


(z—1) 
m - 


z1 Tz jw—0 


thus establishing the fact that evaluation of the low-frequency asymptote of 
D(z)G(z) at w = 1 yields K,. This fact is most easily used if the frequency- 
response magnitude is plotted versus w in units of rad/sec so that w = 1 rad/sec 
is readily found. If the magnitude is plotted versus w in units of Hz or versus 
wT, one would need to perform a calculation to find the w = 1 rad/sec point. 
However, the error constants could be calculated directly with good software 
tools; therefore the issues in their calculation are of passing interest only. But no 
matter how the constants are found, the fact remains for discrete and continuous 
frequency response alike, the higher the magnitude curve at low frequency, the 
lower the steady-state errors. 


€ Example 7.14 


Finding Velocity Constant on a Bode Plot 


Use the discrete Bode plot to determine the K, for the antenna system of Example 7.6 with 
the compensation given by Eq. (7.19). 


Solution. The open-loop discrete transfer function is 


z + 0.9672 z — 0.80 
G(z) D(z) = (0.0484) ————— — — —6)— —. 
2D) (0480 rc - 09048) z = 0.05 
which yields the magnitude versus frequency in Fig. 7.32. Evaluation of the magnitude of the 
low-frequency asymptote at w = | indicates that K, = 1.26. Also note in the figure that the 
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Figure 7.32 
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extension of the low-frequency asymptote reaches crossover at w = 1.26, thus indicating also 
that K, = 1.26 since this Type ! system has a low-frequency slope of — 1. 


7.4.4 


* 


Compensator Design 


The amplitude and phase curves can be used to determine the stability margins 
based on the Nyquist stability criterion for either continuous or discrete systems. 
In the continuous case with minimum-phase transfer functions, Bode showed that 
the phase is uniquely determined by an integral of the slope of the magnitude 
curve on a log-log plot as expressed by Eq. ( 7.47). If the function is rational, these 
slopes are readily and adequately approximated by constants! As a consequence 
the amplitude curve must cross unity gain (zero on a log scale) at a slope of —1 
for a reasonable phase margin. The ability to predict stability from the amplitude 
curve alone in minimum phase systems is an important contributor to the ease 
with which designers can evaluate changes in compensator parameters in those 
cases. 

Bode's relationship between the amplitude and phase curve is lost for discrete 
systems because the variable z takes on values around the unit circle in contrast tos 
traversing the imaginary axis as in continuous systems. Figure 7.16 illustrates the 
degree to which the relationship is lost and indicates that the error would be small 
for frequencies lower than 1/20th of the sample frequency. However, it is typically 
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necessary to determine both magnitude and phase for discrete z-plane systems 
and not depend on magnitude alone for an accurate assessment of the stability. In 
carrying out direct digital designs, some intuition from continuous design can be 
used if the z-plane poles and zeros on the real axis are measured by their distance 
from +1. For example, the equivalent idea in the z-plane for the “breakpoint” in 
Bode's hand-plotting rules is that the magnitude will change slope at a frequency 
when cT, the angular position on the unit circle in radians, has the same value as 
the fractional distance of the singularity on the real axis to z = +1. For example, 
a pole at z = 0.9 will produce a slope change at wT = 0.1 rad. This equivalence 
is very accurate for low angular values (wT < 0.1 rad, i.e., sampling at more 
than 60 times the frequency) and is a reasonable approximation for angular 
values less than 0.8 rad (i.e., sampling at more than 8 times the frequency). 
In order to arrive at trial compensations with potential for better PM, GM, 
steady-state errors, or crossover frequency, it is useful to understand how a pole 
or zero placement will affect the magnitude and phase curves. Because of the 
equivalence ofthe break-point concept between the continuous and discrete cases, 
this is accomplished for discrete systems using the ideas from the continuous 
Bode hand-plotting techniques, keeping in mind that their fidelity degrades as 
frequency approaches the Nyquist frequency. It is easiest to select compensator 
break points if the frequency-response magnitude and phase is plotted versus 
wT so that the correspondence between those curves and the location of the 
compensation parameters is retained. 


€ Example 7.15 


Design of the Antenna Servo Control 


Design a discrete controller for the antenna system with T = 1 using the frequency response. 
The specifications are as before: overshoot less than 16%, settling time less than 10 sec and 
Ri l 

uc 


Solution. The system transfer function is 


z + 0.9672 
(z — 1)(z — 0.9048) ` 


The magnitude and phase of the uncompensated system [G(z)] shown in Fig. 7.35 indicate that 
with a compensator gain of K = 1 the system has a PM of 8° and a gain crossover frequency 
(w,,) of 0.3 rad/sec. The 16% overshoot requirement translates into £ > 0.5, which translates 
in turn into the requirement that the P M be > 50? from Fig. 7.24. The specification for settling 
time translates into the requirement that w, > 0.92. 

Because the gain of (z — 1)G(z) atz = 1 is 1, and T = 1, the compensated system will 
also have K, = 1 provided the DC gain of D(z) = 1. In terms of the frequency response, 
this means that the extension of the low-frequency-magnitude asymptote should pass through 
the value 1 at w = | for the uncompensated case (in Fig. 7.33), and the gain of this low- 
frequency asymptote should not be decreased with any candidate compensation. To maintain 
an acceptable K,, we will evaluate only D(z)'s with a DC gain of 1. The uncompensated 


G(z) = 0.0484 (7.53) 
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system's PM of 8? indicates poor damping, and the W, of 0.3 rad/sec indicates that it will be 
too slow. Just as for continuous systems, c, occurs approximately at the system bandwidth 
and dominant natural frequency; therefore, we should try to change the design so that it has 
a w,, of about 0.9 rad/ sec in order to meet the t, < 10 sec. Once we find a compensation 
that meets the guidelines of PM = 50? and W, = 0.9 rad/sec, we will need to check whether 
the settling time and overshoot specifications are actually met, because the design guidelines 
followed are only approximate. 

Figure 7.33 shows several attempts to produce a design. The breakpoint of the first attempt 
(D, (z) in Fig. 7.33) was at 0.15 rad/sec 4 and did not increase the slope of the magnitude curve 
ata low enough frequency to bring about the desired w,,. This was remedied in D, (z), where 
the breakpoint was lowered to 0.1 rad/sec (zero at z — 0.9) causing a Weg of 0.9 rad/sec, but 
the resulting PM of 40° was still lower than desired. By moving the compensator pole out to 
z = —0.5 in D,(z), we had very little effect on the w,, but achieved an increase in the PM 
to 50°. Because both goals are met, D,(z) has a reasonable chance to meet the specifications; 
in fact, the calculation of a time history of the system response to a step input shows that the 
t, is 7 sec, but, alas, the overshoot is 27%. The guidelines were not successful in meeting the 
specifications because the system is third order with a zero, whereas the rules were derived 
assuming a second-order system without a zero. 

The necessary revisions to our design guidelines are clear; we want more than a 50° PM 
and do not require a 0.9 rad/sec NE Figure 7.34 shows the system frequency response using 
D,(z) along with two revisions of D(z) that satisfy our revised goals. D,(z) has a 60° PM and 
a 0.6 rad/sec Weg? and D, (z) has a 58° PM and a 0.8 rad/sec Weg The time history of the system 


4 The zero at z = 0.85 translates into a 0.15 rad/sec breakpoint only because the sample period, 7, is 1 
sec. For T = 0.1 sec, a zero at z = 0.85 would translate into a 1.5 rad/sec breakpoint, etc. 
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Figure 7.34 
Bode plots of designs 
with controllers D}, D,, 
and D, for Example 7.15 $ 
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Figure 7.35 
Step response of the 
system with controller D. 


Output, y, and control, 4/10 


0 2 4 6 8 10 12 14 16 18 20 


Time (sec) 


response to a step using D,(z) in Fig. 7.35 shows that it exactly meets the requirements for 
16% overshoot and t, = 10 sec. Furthermore, the design of the system imposed the constraint 
that K, = I and the design is complete. 
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Figure 7.36 
Sensitivity plots of 


designs with controllers 


D,, D,, and D, for 
Example 7.15 
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Itis interesting to look at the sensitivity functions for these designs, plotted in Fig. 7.36 as 
log of the sensitivity versus a linear frequency scale to illustrate the balance between positive 
and negative areas for such plots for stable systems. @n this plot, one can see that controller 
D, results in the highest bandwidth and also the highest maximum of the sensitivity or lowest 
vector gain margin. Controller D, has improved robustness (lower maximum of the sensitivity) 
but also lower bandwidth. Finally, the design given by D, splits the difference and meets each 
specification. 


7.5 


* 


Direct Design Method of Ragazzini 


Much of the style of the transform design techniques we have been discussing 
in this chapter grew out of the limitations of technology that was available 
for realization of continuous-time compensators with pneumatic components or 
electric networks and amplifiers. In particular, many constraints were imposed in 
order to assure the realization of electric compensator networks D(s) consisting 
only of resistors and capacitors. With controllers realized by digital computer, 
such limitations on realization are, of course, not relevant and one can ignore these 
particular constraints. An alternative design method that ignores constraints of 


5 In the book by Truxal (1955), where much of this theory is collected at about the height of its first stage 
of development, a chapter is devoted to RC network synthesis. 
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technology has been found to be useful in adaptive controls. Suppose we are given 
a discrete transfer function G(z) of the plant (plus hold) and a desired transfer 
function H(z) between reference R and output Y. The structure is assumed to be 
aunity feedback system and the design is to select the computer transfer function 
D(z) to realize H(z). The overall transfer function is given by the formula 


A = ——.,, 
y= De 


from which we get the direct design formula 
BS Ores (69 
G(z) | — H(z) 


From Eq. (7.54) we can see that this design calls for a D(z) that will cancel 
the plant effects and add whatever is necessary to give the desired result. The 
problem is to discover and implement constraints on H(z) so that we do not ask 
for the impossible. 

First, the design must be causal. From z-transform theory we know that if 
D(z) is to be causal, then as z — oo its transfer function is bounded and does not 
have a pole at infinity. Looking at Eq. (7.54 ), we see that if G(z) were to have 
a zero at infinity, then D(z) would have a pole there unless we require an H (z) 
that cancels it. Thus we have the constraint that for D(z) to be causal 


D(z) (7.54) 


H (z) must have a zero at infinity of the same order 
as the zero of G(z) at infinity. (2:55) 


This requirement has an elementary interpretation in the time domain: A zero 
of order k at infinity in G(z) corresponds to a delay of k samples in the pulse 
response of the plant. The causality requirement on H (z) is that the closed-loop 
system must have at least as long a delay as the plant. 

Considerations of stability add a second constraint. The roots of the charac- 
teristic equation of the closed-loop system are the roots of the equation 


1+ D(z)G(z) = 0. (7.56) 


We can express Eq. (7.56) as a polynomial if D and G are rational and we identify 
D = c(z)/d(z) and G = b(z)/a(z) where a, b, c, and d are polynomials. Then 
the characteristic polynomial is 


ad + bc = 0. (7.57) 


Now suppose there is a common factor in DG, as would result if D(z) were 
called upon to cancel a pole or zero of G (z). Let this factor be z — œ and suppose 
it is a pole of G(z), so we can write a(z) = (z — o)a(z), and to cancel it we have 
c(z) = (z — o)c(z). Then Eq. (7.57) becomes 


(z — a)a(z)d(z) + b(z)(z — a)c(z) 


(Galad abe] i $05) 


0 
0. 
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@ Example 7.16 


In other words— perhaps it was obvious from the start—a common factor remains 
a factor of the characteristic polynomial. If this factor is outside the unit circle, 
the system is unstable! How do we avoid such cancellation? Considering again 
Eq. (7.54), we see that if D(z) is not to cancel a pole of G(z), then that factor of 
a(z) must also be a factor of 1 — H(z). Likewise, if D(z) is not to cancel a zero 
of G (z), such zeros must be factors of H(z). Thus we write the constraints? 


1 — A(z) must contain as zeros all the poles of (7.59) 
G (z) that are outside the unit circle, d 
H (z) must contain as zeros all the zeros of 


G(z) that are outside the unit circle. (7.60) 


Consider finally the constraint of steady-state accuracy. Because H(z) is the 
overall transfer function, the error transform is given by 
E(z) = R(z)(1 — H(2). (7.61) 


Thus if the system is to be Type 1 with velocity constant K,, we must have zero 
steady-state error to a step ane 1/ K, error to a unit ramp. The first requirement is 


e(oo) = lim(z — Di — H(z)] ^ 0, (7.62) 
which implies 
H(1) 2 1. (7.63) 
The velocity constant requirement is that 
e(oo) = lim(z — yet — H(z) = ES (7.64) 
zl (a=) K 


From Eq. (7.63) we know that | — H(z) is zero at z = 1, so that to evaluate the 
limit in Eq. (7.64), it is necessary to use L’H6pital’s rule with the result 
dH l 


cabra =—, 1B 
dz zzl K, = 


Design by the Direct Method 


Consider again the plant described by the transfer function of Eq. (7.53) and suppose we ask 
for a digital design that has the characteristic equation that is the discrete equivalent of the 
continuous characteristic equation 


s +st+1=0, 


with a sampling period T = | sec. 


6 Roots on the unit circle are also unstable by some definitions, and good practice indicates that we 
should not cancel singularities outside the radius of desired settling time. 
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Solution. The discrete characteristic equation according to the specifications is 
z? — 0.7859z + 0.36788 = 0. (7.66) 


Let us therefore ask for a design that is stable, has K, — 1, and has poles at the roots of 
Eq. (7.66) plus, if necessary, additional poles at z — 0, where the transient is as short as 
possible. The form of H (z) is thus 


ooyrkbi Absa? big us 


HG) = oas 0367977 on 
The causality design constraint, using Eq. (7.55) requires that 
H(z) co = 0 
or 
b, =0. (7.68) 


Equations (7.59) and (7.60) add no constraints because G(z) has all poles and zeros inside the 
unit circle except for the single zero at oo, which is taken care of by Eq. (7.68). The steady-state 
error requirement leads to 


H(1)21 
b, +b, cb dee 

1 — 0.7859 + 0.3679 (7:69) 

Therefore 
b, +b, +b, +--+ = 0.5820, 

and 

o ie 

dz ha K, 


Because in this case both 7 and K, are 1, we use Eq. (7.69) and the derivative with respect to 
ll . 
Z^ toobtain 


TEN A 
z K, n di z=) 
_ (0.5820) [b, + 2b, +36, +--+] — [0.5820][—0.7859 + 0.3679(2)] 


(0.5820) (0.5820) 
or 
b, + 2b, + 3b, + --- —[—0.05014] _ i 
0.5820 SE 
Because we have only two equations to satisfy, we need only two unknowns and we can 
truncate H(z) at b,. The resulting equations are 
b, + b, = 0.5820, 
b, + 2b, = 0.5318, 


(7.70) 


which have the solution 


b, — 0.6321, (7.71) 
b, = —0.05014. (7.72) 
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Thus the final design gives an overall transfer function 


0.6321z — 0.05014 


Hg. ee MUN. 
(2) = 73 97859: + 0.3679 


(7.73) 


We shall also need 
(z — 1)(z — 0.4180) 
z? — 0.7859z + 0.3679 


We know that H (1) = 1 so that | — H(z) must have a zero at z = 1. Now, turning to the basic 
design formula, Eq. (7.54), we compute 


1 — H(z) = (7.74) 


(z —1)(z — 0.9048)(0.6321)  (z — 0.07932) 
(0.04837)(z + 0.9672) (z — 1)(z — 0.4180) 

(z — 0.9048) (z — 0.07932) 

(z + 0.9672) (z — 0.4180) ` 


D(z) = 


= 13.07 


A plot of the step response of the resulting design for this example is provided 
in Fig. 7.37 and verifies that the response samples behave as specified by H(z). 
However, as can be seen also from the figure, large oscillations occur in the 


Figure 7.37 

Step response of 
antenna system from 
direct design 


Output, y, and control, u/20 


Time (sec) 


7.6 
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control that cause a large ripple in the system response between samples. How 
can this be for a system response transfer function, 


Y(z) DG 

RG) = H(z) = TDG: 
that is designed to have only two well-damped roots? The answer lies in the fact 
that the control response is determined from 

U() . D H) 

Wü) 1+DG Gi) 


which for this example is 


U(z) z — 0.0793 (z — 1)(z — 0.9048) 
Los cl 1306 — 
R(z) z^ — 0.78592 + 0.3679 z + 0.9672 
The pole at z = —0.9672, very near to the unit circle, is the source of the oscil- 


lation in the control response. The poor transient due to the pole did not show 
up in the output response because it was exactly canceled by a zero in the plant 
transfer function. The large control oscillation in turn causes the ripple in the 
output response. This pole was brought about because we allowed the controller 
to have a pole to cancel a plant zero at this position. The poor response that 
resulted could have been avoided if this nearly unstable zero had been included 
in the stability constraint list. In that case we would introduce another term in 
H(z), bare, and require that H(z) be zero at z = —0.9672, so this zero of G(z) 
is not canceled by D(z). The result will be a simpler D(z) with a slightly more 
complicated H(z). 


Summary 


In this chapter we have reviewed the philosophy and specifications of the design 
of control systems by transform techniques and discussed three such methods. 


Discrete controllers can be designed by emulation, root locus, or frequency 
response methods. 

Successful design by emulation typically requires a sampling frequency at 
least 30 times the expected closed-loop bandwidth. 

Expressions for steady-state error constants for discrete systems have been 
given in Eq. (7.12) and Eq. (7.14) in terms of open-loop transfer functions 
and in Eq. (7.18) in terms of closed-loop poles and zeros. 

Root locus rules for discrete system characteristic equations are shown to be 
the same as the rules for continuous system characteristic equations. 

Step response characteristics such as rise time and overshoot can be corre- 
lated with regions of acceptable pole locations in the z-plane as sketched in 
Fig. 7.10. 
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Asymptotes as used in continuous system frequency response plots do not 
apply for discrete frequency response. 

Nyquist's stability criterion and gain and phase margins were developed for 
discrete systems. 

The sensitivity function was shown to be useful to develop specifications on 
performance robustness as expressed in Eq. (7.34). 

Stability robustness in terms of the overall transfer function, the complemen- 
tary sensitivity function, is expressed in Eq. (7.47). 

Limitations on the frequency response of closed-loop discrete designs are 
made more severe by poles and zeros outside the unit circle as expressed by 
Eq. (7.50) and Eq. (7.51). 

Lead and lag compensation can be used to improve the steady-state and 
transient response of discrete systems. 

The direct design method of Ragazzini can be used to realize a closed-loop 
transfer function limited only by causality and stability constraints. 


7.7 Problems 


TA 


7.2 


7.3 


Use the z = e^ mapping function and prove that the curve of constant ¢ in s is a 
logarithmic spiral in z. 

A servomechanism system is expected to have a rise-time of no more than 10 milliseconds 
and an overshoot of no more than 596. 


(a) Plot in the s-plane the corresponding region of acceptable closed-loop pole 
locations. 


(b) What is the estimated Bode gain crossover frequency (rad/sec)? 
(c) What is the estimated phase margin in degrees? 


(d) What is the sample period, 7, if the estimated phase shift due to the sample and 
hold is to be no more than 10? at the gain crossover? 


(e) What is the sample period 7 if there are to be 8 samples per rise time? 


Root locus review. The following root loci illustrate important features of the root locus 
technique. All are capable of being sketched by hand, and it is recommended that they be 
done that way in order to develop skills in verifying a computer's output. Once sketched 
roughly by hand, it is useful to fill in the details with a computer. 


(a) The locus for 


1+K ud =0 
sės + p) 
is typical of the behavior near s — 0 of a double integrator with lead compensation 
or a single integration with a lag network and one additional real pole. Sketch the 
locus versus K for values of Pi of 5, 9, and 20. Pay close attention to the real axis 
break-in and break-away points. 


7.4 


7.6 
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(b) The locus for 


(c) 


(d) 


K l 7 =0 
s(s + 1)(G +a) + 4) 


is a typical locus that includes complex poles and shows the value of departure 
angles. Plot the locus for a = 0, 1, and 2. Be sure to note the departure angles from 
the complex poles in each case. 


The locus for 


iP Le 
page Eon q 
s(s° + 4) 


illustrates the use of complex zeros to compensate for the presence of complex 

poles due to structural flexibility. Be sure to estimate the angles of departure and 
arrival. Sketch the loci for w, = 1 and w, = 3 . Which case is unconditionally 

stable (stable for all positive K less than the design value)? 


For 


M 


kK ——————_ = 0 
(s — p,)(s — pj) 


l4 


show that the locus is a circle of radius ,/p, p, centered at the origin (location of 
the zero). Can this result be translated to the case of two poles and a zero on the 


negative real axis? 


4 ; 3 ; : 1 
The basic transfer function of a satellite attitude control is G(s) = —. 
so 


(a) 


(b) 


(c) 


Design a continuous lead network compensation so as to give closed-loop poles 
corresponding to ¢ = 0.5 and natural frequency w, = 1.0. The ratio of pole to zero 
of the lead is to be no more than 10. 


i. Plot the step response of the design and note the rise time and the percent 
overshoot. 


ii. What is the system type and corresponding error constant? 


Select a sampling period to give 10 samples in a rise time and compute the discrete 
equivalent to the lead using the Tustin bilinear transformation. Plot the step response 
of the discrete system and compare the rise time and overshoot to those of the 
continuous design. 


Select a sampling period that will give 5 samples per rise time, compute the discrete 
equivalent using Tustin's method, and compare rise time and overshoot of this 
design with the continuous case. 


Repeat the design for the satellite attitude control of Problem 4, including method of 
choosing sampling periods but using the matched pole-zero method to obtain the discrete 
compensations. 


Repeat the design of the satellite attitude control of Problem 4 including method of choice 
of sampling period but using the triangle hold equivalent (noncausal first-order hold) to 
design the discrete compensations. 
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7.7 


7.8 


7.9 


7.10 


7.11 


Repeat the design for the satellite attitude control of Problem 4 but augment the plant 
with a Pade approximation to the delay of 7/2 which is to say, multiply the plant transfer 
function by 


tn 


P(s) = 4 
(s) Tu 


before doing the continuous design. Once the design of the lead compensation is com- 
pleted, continue with the discrete equivalents as in Problem 4, including the method of 
choosing sampling periods. Use the matched pole-zero method to obtain the discrete 
compensations. Compare the design with the continuous case. 


Design a discrete compensation for the antenna control system as specified in Example 
7.2 with a sample period of T = 0.1 using a matched pole-zero equivalent for the discrete 
compensation. Plot the step response and compare rise time and overshoot with those of 
the continuous design. 


Design the antenna control system as specified in Example 7.2 with a sample period of 
T = 0.5 sec. 


(a) Use the zero-pole mapping equivalent emulation method. 


(b) Augmentthe plant model with an approximation of the sample-hold delay consisting 
of 


then redesign D(s) and find the discrete equivalent with the matched pole-zero 
equivalent emulation method. Plot the step response and compare with the 
continuous design done on the unaugmented plant. 


(c) Compare the degradation of the equivalent damping ratio £ due to sampling for 
both design methods. 


For the satellite with transfer function 1/s?, design a lead compensation to give closed- 
loop poles with damping c = 0.5 and natural frequency w, = 1.0. The pole-to-zero ratio 
of the compensation should not be more than 10. Plot the step response of the closed 
loop and note the rise time and overshoot. 

(a) Let sampling period be Ts = 0.5 sec and compute the discrete model of the 
plant with a sample and zero-order hold. Using this model, design a discrete lead 
compensation with pole at z = —0.5 and a zero so as to give closed loop poles at 
the mapped place from the continuous poles, ¢ = 0.5 and w, = 1.0. What is the 
ratio of w,/w, for this problem? How many samples do you expect to find per rise 
time? Plot the step response and compare result with expectation. Compare the 
discrete design with the continuous design. 

(b) Repeat the discrete design with sampling period Ts = 1.2 sec, plot the step 
response, and compare rise time and overshoot with the continuous case. 


Sketch the region in the z-plane of discrete pole locations corresponding to ¢ > 0.5 and 
(a) w, < œw /30. 
(b) w, < «,/10. 


(c) w, < w,/5. 


Figure 7.38 
A Steel ball balanced by 
an electromagnet 
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7.12 The plant transfer function 


7.13 


l 
(s +0.1)(s + 3) 
is to be controlled with a digital controller using a sample period of T = 0.1 sec. 


G(s) = 


(a) Design compensation using the z-plane root locus that will respond to a step with a 
rise time of < 1 sec and an overshoot < 5%. Plot the step response and verify that 
the response meets the specifications. 


(b) What is the system type and corresponding error constant? What can be done to 
reduce the steady-state error to a step input? 

(c) Design a discrete lag compensation that will cut the steady-state error in half. 
Plot the step response and compare the complete response to the transient and 
steady-state error specifications. 


It is possible to suspend a steel ball bearing by means of an electromagnet whose current 
is controlled by the position of the mass [Woodson and Melcher (1968)]. A schematic of 
a possible setup is shown in Fig. 7.38. The equations of motion are 


m X= —mg + f(X, D, 


where the force on the ball due to the electromagnet is given by f(X, I). It is found that 
the magnet force balances the gravity force when the magnet current is I, and the ball at 
X,. If we write I = 1, + / and X = X, + x and expand f about X — X, and l = I, and 
then neglect higher-order terms, we obtain a linear approximation 


mk = kx + ki. 
Values measured for a particular device in the Stanford Controls Laboratory are m = 0.02 
kg, k, = 20 N/m, k, = 0.4 N/A. 
(a) Compute the transfer function from į to x and draw the (continuous) root locus for 
proportional feedback i = — Kx. 
(b) Let the sample period be 7 = 0.02 sec and compute the plant discrete transfer 
function when used with a sample and zero-order hold. 


(c) Design a digital control for the magnetic levitation to meet the specifications 
t, < 0.1 sec, ¢, < 0.4 sec, and overshoot < 20%. 
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Figure 7.39 
An automotive 
cruise-control system 


(d) 


(e) 


Plot a root locus of your design versus m and discuss the possibility of balancing 
balls of various masses. 


Plot a step response of your design to an initial disturbance displacement on the ball 
and show both x and the control current ?. If the sensor can measure x over a range 
of only +i cm, and if the amplifier can provide a maximum current of 1 A, what is 
the maximum initial displacement, x (0) „a, that will keep the variables within these 
limits, using m = 0.02 kg? 


7.14 A discrete transfer function for approximate derivative control is 


-1 
Tz ' 


DG) - K,T 5 


where the pole at z — 0 adds some destabilizing phase lag. It therefore seems that it 
would be advantageous to remove it and to use derivative control of the form 


DO = K,T, ==. 


Can this be done? Support your answer with the difference equation that would be 
required and discuss the requirements to implement it. 


7.15 For the automotive cruise-control system shown in Fig. 7.39, the sample period is T = 
0.5 sec. 


(a) 
(b) 
(c) 


Design a PD controller to achieve a t, of 5 sec with no overshoot. 
Determine the speed error on a 396 grade (i.e., G, = 3 in Fig. 7.39). 


Design a PID controller to meet the same specifications as in part (a) and that has 
zero steady-state error on constant grades. What is the velocity constant of your 
design? 


7.16 For the disk drive read/write head assembly described in Fig. 7.40, you are to design 
a compensation that will result in a closed-loop settling time £, = 20 msec and with 
overshoot to a step input M, < 20%. 


(a) 


(b) 


Assume no sampling and use a continuous compensation. Plot the step response 
and verify that your design meets the specifications. 


Assume a sampling period 7 = | msec and use matched pole-zero emulation. If 
you wish, you can include a Pade approximation to the delay and do a redesign 
of the continuous compensation before computing the discrete equivalent. Plot the 
step response and compare it with the continuous design’s response. 


Fuel Grade, G}, Vehicle 
Throttle dynamics dynamics 
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Figure 7.40 p Electronic Head 
A disk drive read/write lag dynamics 


head assembly 


Sensor 


(c) Doa z-plane design for the same specifications and plot its step response. Compare 
these three designs with respect to meeting the transient performance. 


7.17 The tethered satellite system shown in Fig. 7.41 has a moveable tether attachment point 
so that torques can be produced for attitude control. The block diagram of the system is 
shown in Fig. 7.42. Note that the integrator in the actuator block indicates that a constant- 
voltage command to the servomotor will produce a constant velocity of the attachment 
point. 


(a) Isit possible to stabilize this system with 0 feedback to a PID controller? Support 
your answer with a root locus argument. 

(b) Suppose it is possble to measure ó, and as well as 6. Select the variable(s) on 
which you would like to place a sensor(s) to augment the 0 feedback. 


(c) Design compensation for the system using the sensor(s) that you selected in part 
(b) so that it has a 2-sec rise time and equivalent closed loop damping of ¢ = 0.5. 


Figure 7.41 
A tethered satellite 
system 
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Figure 7.42 
Block diagram for the 
tethered satellite system 


el Sensor(s) =- ? 


ô = displacement of tether attachment point 
T, = tether tension 
M = moment on satellite 

= satellite attitude 


Stick 
Figure 7.43 Angle sensor 
An excavator with an Actuator 
automatic contro! system | 


7.18 The excavator shown in Fig. 7.43 has a sensor measuring the angle of the stick as part of 
a control system to control automatically the motion of the bucket through the earth. The 
sensed stick angle is to be used to determine the control signal to the hydraulic actuator 
moving the stick. The schematic diagram for this control system is shown in Fig 7.44, 
where G(s) is the system transfer function given by 


* 1000 

s(s + 10) (s? + 1.25 4-144) 
The compensation is implemented in a control computer sampling at f, = 50 Hz and is 
of the form D(z) = K(1 + K,(1 —z7') + K,(1 —2z ^! +z). The oscillatory roots 
in G(s) arise from the compressibility of the hydraulic fluid (with some entrained air) 
and is often referred to as the oil—miass resonance. 


G(s) 


Figure 7.44 

Schematic diagram for 
the control system of the 
excavator 


(a) 


(b) 


(c) 


(d) 
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T = 0.02 sec 


Show that the steady-state error. € (= 0, — 0), is 


isis 1.44 
sx 

when 6, is a unit ramp. 
Determine the highest K possible (i.e., at the stability boundary) for proportional 
control (K= K, = 0). 
Determine the highest K possible (i.e., at the stability boundary) for PD (K, = 0) 
control. 
Determine the highest K possible (i.e., at the stability boundary) for PD plus 
acceleration (K, + 0) control.’ 


7.19 For the excavator described in Problem 7.18 with transfer function 


1000 


G(s) = ——_—___, 
s(s + 10)(s? + 1.25 + 144) 


plot the Bode plot and measure the gain and phase margins with just unity feedback. 


(a) 


(b) 


Design a compensation that will give a phase margin of 50°. a gain margin as 
measured at the resonance peak of at least 2 and a crossover of at least to 2 1.0. 
Plot the step response of the resulting design and note the rise time and the 
overshoot. 


With sample frequency f, = 50Hz, design a discrete controller to meet the same 
specifications given for the continuous case. Plot the Bode plot of the design and 
verify that the specifications are met. 


7.20 A simple model of a satellite attitude control has the transfer function 1/s?. 


(a) 


(b) 


(c) 


Plot the Bode plot for this system and design a lead compensation to have a phase 
margin of 50? and a crossover o. of 1.0 rad/sec. Plot the step response and note 
the rise time and overshoot. 


With sample period of 7 — 0.2 sec, design a discrete compensation to give 50? 
phase margin and crossover «,, = 1.0. Plot the step response and compare rise 
time and overshoot with the continuous design. 


With sample period T = 0.5 sec, design a discrete compensation to give 50? phase 
margin and crossover c, = 1.0. Plot the step response and compare rise time and 
overshoot with the continuous design. 


7.21 Fora system given by 


DM. 
~ s(s +a) 


G(s) 


7 For further reading on damping the oil-mass resonance of hydraulic systems, see Viersma (1980). 
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7.22 


7.23 


determine the conditions under which the K, of the continuous system is approximately 
equal to the K, of the system preceded by a ZOH and represented by its discrete transfer 
function. 
Design a digital controller for 

jl 
s(s + 0.4) 
preceded by a ZOH so that the response has a rise time of approximately 0.5 sec, overshoot 
« 25%, and zero steady-state error to a step command. [Hint: Cancel the plant pole at 
s = 0.4 with a compensator zero; a second-order closed-loop system will result, making 
the transient response comparison between experiment and theory much easier.] 


G(s) = 


(a) Determine a D(z) using emulation with the matched pole-zero mapping technique. 
Do two designs, one for 7 = 100 msec and one for 7 = 250 msec. 

(b) Repeat part (a) using the z-plane root locus method for the two sample periods. 

(c) Simulate the closed-loop system response to a unit step with the D(z)’s obtained 
in parts (a) and (b). Use the discrete equivalent of the plant in your calculations. 
Compare the four digitally controlled responses with the original specifications. 
Explain any differences that you find. 

A generic mechanical control problem is that of two masses coupled by a lightly damped 

flexible structure. With amplitude and time scaling, the model can be reduced to 
Bs+1 

s (s? + Bs +1)’ 

For the parameters K = 0.05 and B = 0.1, plot the Bode plot and indicate the gain 

and phase margins. For performance tracking, the open loop gain for frequencies below 


w = 0.005 must be above 50 and the gain for frequencies above w = | must be below 
0.5. The phase margin must be at least 35°. 


G(s) = K 


(a) Design a continuous compensation to meet the specifications. If possible, keep the 
phase above — 180° so the system will not be conditionally stable. 

(b) Design a discrete compensation for the system with the lowest sampling frequency 
possible. A prize will be given to the student who obtains the lowest sampling 
frequency with a design that meets the specifications. 


(c) Plot the sensitivity of the digital design and compute the vector gain margin. 


-S- 


Design Using State-Space Methods 


A Perspective on State-Space Design Methods 


In Chapter 7, we discussed how to design digital controllers using transform 
techniques, methods now commonly designated as “classical design." The goal 
of this chapter is to solve the identical problem using the state-space formulation. 
The difference in the two approaches is entirely in the design method; the end 
result, a set of difference equations providing control, is identical. 

Advantages of the state-space formulation are especially apparent when de- 
signing controllers for multi-input, multi-output (MIMO) systems, that is, those 
with more than one control input and/or sensed output. However, state-space 
methods are also an aid in the design of controllers for single-input, single-output 
(SISO) systems because of the widespread use of computer-aided control system 
design (CACSD) tools, which often rely heavily on this system representation. 
Chapters 4 and 5 have already demonstrated the advantages of the state-space for- 
mulation in using CACSD packages for the computation of discrete equivalents. 
State-space methods also offer advantages in the structure for command inputs 
and disturbance estimation. In this chapter, we will limit our state-space design 
efforts to SISO controllers, similar to those found in Chapter 7 with classical 
methods. Techniques for MIMO design are discussed in Chapter 9. 

In Chapter 7, two basic methods were described: emulation and direct digital 
design. The same two methods apply to the state-space formulation as well. Using 
emulation, one would design a continuous controller using state-space methods, 
then transform the controller to a discrete form by using one of the discrete 
equivalents from Chapter 6. The discussion of the method and its accuracy in 
Chapter 7 applies equally well here. Furthermore, the development in Chapter 6 
used both classical and state-space system descriptions in the computation of the 
equivalents. Therefore, no further discussion of emulation is required, and we 
will concentrate solely on the direct digital design method. 
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8.1 


Chapter Overview 


As for the continuous case reviewed in Section 2.6, design using state-space 
involves separate design of the control assuming all state elements are available, 
then design of an estimator to reconstruct the state given a partial measurement 
of it. Section 8.1 covers the discrete control design while Section 8.2 covers 
the discrete estimator design. Section 8.3 puts it together into what is called the 
regulator and Section 8.4 discusses the relative merits of the various ways of 
introducing the reference input command. Section 8.5 presents how a designer 
builds an integral control feature or disturbance estimation, and how they accom- 
plish similar goals. Section 8.6 discusses the limitations imposed by delays in the 
system and how to minimize their effect. The chapter concludes in Section 8.7 
with a discussion of observability and controllability, the required conditions for 
the design to be possible. 


Control Law Design 


In Chapter 2, we saw that the state-space description of a continuous system is 
given by Eq. (2.1) 


x = Fx + Gu, (8.1) 
and Eq. (2.2) 
y = Hx + Ju. (8.2) 


We assume the control is applied from the computer through a ZOH as shown in 
Fig. 1.1. Therefore, Eqs. (8.1, and (8.2) have an exact discrete representation as 
given by Eq. (4.59) 


x(k+1) = ®x(k) +PFu(k), (8.3) 
y(k) = Hx(k)+ Ju(&), . 
where 
o = pb 
r = fl ednG. eS 


We can easily transform between the classical transfer function of a continuous 
system, G(s) (represented by sysTF in MATLAB), to the state-space continuous 
description by the MATLAB script! 


SysC = ss(sysTF) 


1 In MATLAB, G(s) would be represented by [num, den] and the tf2ss function would be invoked. See 
Appendix F. 


estimator 
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where sysC contains F, G, H, J. Provided the continuous system is preceded by 
a ZOH, we can transform to the discrete model, $, I, H, J, (or sysD) with a 
sample period T, using the MATLAB script 


sysD = c2d(sysC,T). 


One of the attractive features of state-space design methods is that the pro- 
cedure consists of two independent steps. The first step assumes that we have all 
the state elements at our disposal for feedback purposes. In general, of course, 
this would not be a good assumption; a practical engineer would not, as a rule, 
find it necessary to purchase such a large number of sensors, especially because 
he or she knows that they would not be needed using classical design methods. 
The assumption that all states are available merely allows us to proceed with 
the first design step, namely, the control law. The remaining step is to design 
an “estimator” (or "observer?), which estimates the entire state vector, given 
measurements of the portion of the state provided by Eq. (8.2). The final control 
algorithm will consist of a combination of the control law and the estimator with 
the control-law calculations based on the estimated states rather than on the actual 
states. In Section 8.3 we show that this substitution is reasonable and that the 
combined control law and estimator can give closed-loop dynamic characteristics 
that are unchanged from those assumed in designing the control law and estimator 
separately. The dynamic system we obtain from the combined control law and 
estimator is the same that has been previously referred to as compensation. 

As for the continuous case, the control law is simply the feedback of a linear 
combination of all the state elements, that is 


u = -Kx = -| K K, 


1 2 


(8.5) 


rm 
PEE. 
[CR 


Note that this structure does not allow for a reference input to the system. The 
topology that we used all through Chapter 7 (Fig. 7.5) always included a reference 
input, r. The control law, Eq. (8.5), assumes that r = 0 and is, therefore, usually 
referred to as a regulator. Section 8.4 will discuss how one introduces reference 
inputs. 

Substituting Eq. (8.5) in Eq. (8.3), we have 


x(k + 1) = x(k) — 'Kx(4). (8.6) 
Therefore the z-transform of Eq. (8.6) is 
(z I — P + PK)X(z) = 0, 


2 The literature [Luenberger (1960)) commonly refers to these devices as "observers"; however, we 
feel that the term “estimator” is much more descriptive of their function because “observe” implies 
a direct measurement. In this book the term “estimator” is used but the reader can think of the terms 
interchangeably. 
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8.1.1 


and the characteristic equation of the system with the hypothetical control law is 


lzi-®+PK|=0. (8.7) 


Pole Placement 


The approach we wish to take at this point is pole placement; that is, having picked 
a control law with enough parameters to influence all the closed-loop poles, we 
will arbitrarily select the desired pole locations of the closed-loop system and see 
if the approach will work. Although this approach can often lead to trouble in the 
design of complex systems (see the discussion in Sections 2.6 and 8.3), we use it 
here to illustrate the power of full state feedback. In Chapter 9, we will build on 
this idea to arrive at a more practical design methodology. 

The control-law design, then, consists of finding the elements of K so that 
the roots of Eq. (8.7), that is, the poles of the closed-loop system, are in the 
desired locations. Unlike classical design, where we iterated on parameters in 
the compensator (hoping) to find acceptable closed-loop root locations, the full 
state feedback, pole-placement approach guarantees success and allows us to 
arbitrarily pick any pole location, providing that n poles are specified for an 
nth-order system. 

Given desired pole locations,’ say 


z = Bp Bas Pss- Bpo 


the desired control-characteristic equation is 


a (z) = (z — B,)(z — b) -z —B,) =0. (8.8) 


Equations (8.7) and (8.8) are both the characteristic equation of the controlled 
system; therefore, they must be identical, term by term. Thus we see that the 
required elements of K are obtained by matching the coefficients of each power 
of z in Eq. (8.7) and Eq. (8.8), and there will be n equations for an nth-order 
system. 


€ Example 8.1 


Pole Placement for Satellite Attitude Control 


Design a control law for the satellite attitude-control system described by Eq. (4.47). Pick the 
z-plane roots of the closed-loop characteristic equation so that the equivalent s-plane roots 
have a damping ratio of & = 0.5 and real part of s = —1.8 rad/sec (i.e., s = —1.8 + j3.12 
rad/sec). Use a sample period of T = 0.1 sec. 


3 Discussion of how one selects pole locations was reviewed in Section 2.6 for the continuous case and 
will occur through the following examples and in Section 8.3 for the discrete case. The results of the 
specification discussion in Chapter 7 can also be used to specify poles. 
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Solution. 


Example 4.11 showed that the discrete model for this system is 


LT T^ 
9-[i [| and Es] T | 
Using z = e^ with a sample period of T = 0.1 sec, we find that s = —1.8 + j3.12 rad/sec 


translates toz = 0.8 + j0.25, as shown in Fig. 8.1. The desired characteristic equation is then 


z? — 1.62 + 0.70 =0, 
and the evaluation of Eq. (8.7) for any control law K leads to 


(8.9) 
1 0 ry q 4/3 = 
[HS TRES T spe 
or 
2+ (TK, -(T?/2K, —2)z + (T?/2K, - TK, +1 =0. (8.10) 
Figure 8.1 


Desired root locations for satellite attitude-control system of Examples 8.1 and 8.4 


{> 


a S 


17 
a 
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z = plane loci of roots of constant $ and w, A control roots 

s=- t lon yi 2g? 

zc 
T = sampling period 


A estimator roots 
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Equating coefficients in Eqs. (8.9) and (8.10) with like powers of z, we obtain two simultaneous 
equations in the two unknown elements of K 
TK, +(T?/2)K, -2 = —16, 
(T?/2K, — TK, + 1 = 0.70, 
which are easily solved for the coefficients and evaluated for T — 0.1 sec 
0.10 0.35 


1 = 


control canonical form 


* 


The calculation of the gains using the method illustrated in the previous 
example becomes rather tedious when the order of the system (and therefore 
the order of the determinant to be evaluated) is greater than 2. A computer does 
not solve the tedium unless it is used to perform the algebraic manipulations 
necessary in expanding the determinant in Eq. (8.7) to obtain the characteristic 
equation. Therefore, other approaches have been developed to provide convenient 
computer-based solutions to this problem. 

The algebra for finding the specific value of K is especially simple if the 
system matrices happen to be in the form associated with the block diagram of 
Fig. 4.8(c). This structure is called “control canonical form" because it is so 
useful in control law design. Referring to that figure andtaking the state elements 
as the outputs of the delays (z~' blocks), numbered from the left, we get (assuming 

o = 0 for this case) 


a ag 1 
®=| 1 0 0 |}, ETS i 0h WSW Al 
1 0 0 


(8.11) 
Note that from Eq. (4.15), the characteristic polynomial of this system is a(z) = 
"d taz -F a4z + a,. The key idea here is that the elements of the first row of 
®_ are exactly the coefficients of the characteristic polynomial of the system. If 
we now form the closed-loop system matrix b, — I,K, we find 
—a,—K, —a,—K, —a,— K, 
® -T K= ] 0 0 : (8.12) 
0 l 0 
By inspection, we find that the characteristic equation of Eq. (8.12) is 
z + (a, + K,)2? + (a, + K,)z + (a, + K;) =0. 
Thus, if the desired pole locations result in the characteristic equation 


z? + az? +æ, +o = 0, 


8.1.2 
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then the necessary values for control gains are 


KR, K, 


= 0, — a), K, =a, —4,. (8.13) 
Conceptually, then, we have the canonical-form design method: Given an ar- 
bitrary (b, I) and a desired characteristic equation w(z) = 0, we convert (by 
redefinition of the state) (6b, I) to control form ($, T.) and solve for the gain 
by Eq. (8.13). Because this gain is for the state in the control form, we must, 
finally, express the result back in terms of the original state elements. This method 
is sometimes used by CACSD packages because of the numerical advantages; 
however, the transformation is transparent to the designer, who generally prefers 
to use a state definition that is related to the physical system's characteristics. 


Controllability 


The first question this process raises is existence: Is it always possible to find an 
equivalent (95. , I.) for arbitrary (®, I)? The answer is almost always “yes.” The 
exception occurs in certain pathological systems, dubbed “uncontrollable,” for 
which no control will give arbitrary pole locations. These systems have certain 
modes or subsystems that are unaffected by the control. Uncontrollability is 
best exhibited by a realization (selection of state elements) where each state 
element represents a natural mode of the system. If all the roots of the open-loop 
characteristic equation 


|z- 6|-0 


are distinct, then Eq. (8.3) written in this way (normal mode or “Jordan canonical 
form") becomes 


£ ad m r, 
emel ^ Hs as) vm ug (8.14) 
x(k + E X + 3 u 4 " 
0 0 z m : 
0 0 vx r 


n n 


and explicitly exhibits the criterion for controllability: No element in I' can be 
zero. If any l element was zero, the control would not influence that normal 
mode, and the associated state would remain uncontrolled. A good physical 
understanding of the system being controlled usually prevents any attempt to 
design acontroller for an uncontrollable system; however, there is a mathematical 
test for controllability applicable to any system description, which may be an 
additional aid in discovering this condition; a discussion of this test is contained 
in Section 8.7. 
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8.1.3 


Ackermann’s formula 


controllability matrix 


Pole Placement Using CACSD 


MATLAB has two functions that perform the calculation of K: place.m and 
acker.m. Acker is based on Ackermann's formula [Ackermann (1972)] and is 
satisfactory for SISO systems of order less than 10 and can handle systems with 
repeated roots. The relation is 


K-[0.- 1][r er er .. e7"r]'e() (15) 


where C = [T &qQL...]1scalled the controllability matrix, n is the order of the 
system or number of state elements, and we substitute ® for z in o (z) to form 


a (D) =" +a d^ o, d? E... ro T (8.16) 
where the @,’s are the coefficients of the desired characteristic equation, that is, 
a (z) —|zI- $ - TK|2z' +a"! + +a, (8.17) 


The controllability matrix, C, must be full rank for the matrix to be invertible and 
for the system to be controllable. 

place [Kautsky, Nichols, and Van Dooren (1985)] is best for higher order 
systems and can handle MIMO systems, but will not handle systems where the 
desired roots are repeated. 

Note that these functions are used for both the continuous and discrete cases 
because they solve the same mathematical problems given by Eqs. (2.37) and 
(8.7). The only difference is that the desired root locations are in substantially 
different locations for the s-plane and z-plane and that F, G have been replaced 
by ® and T. 


$ Example 8.2 


Control Law Pole Placement with MATLAB 


Design a control law for the satellite attitude-control system as in Example 8.!. Place the 
z-plane closed-loop poles at z = 0.8 + 70.25. 
Solution. The MATLAB statements 

Tet 

Phi = [1 T;0 1] 

Gam = [T?/2;T] 

p = [.8+i* .25;.8-i*.25] 

K = acker(Phi,Gam,p) 
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result in K = [10.25 3.4875]. The difference between these values and those shown in 
Example 8.1 are due to round-off error in the hand calculation. place would have given the 
same answer. 


* 


A more complex system demonstrates the kind of difficulty you might en- 
counter in using the pole placement approach. The specific difficulty is brought 
about by the necessity to pick n desired pole locations. Where should the higher 
frequency poles be picked? The system specifications typically help the designer 
pick only two of the desired poles. As discussed in Section 2.6, it is helpful 
to move poles as little as possible in order to minimize the required amount of 
control effort and to avoid exciting the system any more than necessary. The 
following example specifically illustrates how a poor choice of desired poles 
can cause an undesirable response, and how a wise choice of desired poles can 
drastically improve the situation. 


€ Example 8.3 


Pole Placement for a 4th-Order System with MATLAB 


Design a control law for the double mass-spring system in Appendix A.4 using d as the 
measurement. This system is representative of many systems where there is some flexibility 
between the measured output and control input. Assume the resonant mode has a frequency 
w, = | rad/sec and damping ¢ = 0.02 and select a 10:1 ratio of the two masses. The parameters 
that provide these characteristics are: M = 1 kg, m = 0.1 kg, b = 0.0036 N-sec/m, and k = 
0.091 N/m. Pick the sample rate to be 15 times faster than the resonance and show the free 
response to an initial condition of d = 1 m for two cases: 


(a) Pick all the poles at z — 0.9. 
(b) Pick the poles at z = 0.9 + j0.05, 0.8 + j0.4. 


Discuss why the response is better for case (b). 


Solution. From Eq. (A.17) we can write the state-space description as 


x-[d d y 5] H = [1000] 
0 ] 0 0 0 
—0.91 —0.006 0.91 0.036 0 

=| ut 6 0 SA T* | ole sae 
l 


0.091 0.0036 —0.091 —0.0036 


The sample rate should be 15 rad/sec which translates to approximately T = 0.4 secs. 


(a) All poles at z = 0.9, the MATLAB script 
SysC = ss(F,G,H,J) 
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Figure 8.2 

Initial condition response 
for Example 8.3; 

(a) desired poles all at 

z — 0.9, and (b) desired 
poles at z — 


0.9 + j0.05, 0.8 + j0.4 


(b) 


d andu 


(b) 


sysD = c2d(sysC, T,‘zoh’) 
p =[.9;.9;.9;.9] 
[phi,gam,H,J] = ssdata(sysD) 
K=acker(phi,gam,p) 
results in the feedback gain 
K = [0.650 —0.651 — 0.645 0.718). (8.18) 


place cannot be used for case (a) because of the repeated roots. For the response to an 
initial condition, the script 


sysCL = feedback(K*sysD, 1) 
Xo = [1;0;0;0] 
d = initial(sysCL, Xo) 
produces the closed-loop response for d(0) = 1 m, shown in Fig. 8.2(a). It exhibits a 


response that is much larger than that of the initial condition, but the time characteristics 
are consistent with the selected poles. 


For the desired poles at z = 0.9 + j0.05, 0.8 + j0.4, we modify the script above with 


p = [.9+i*.05;.9-i*.05;.8+i*.4;.8-i*.4] 


ARES 2S se a aa =e EL ae a 
[ 1 [ ] t 

SE a i = =i dius um f CONG aei mim [emipm---- 
' 1 ] | ' i D 

B aS EC tec Se pe a a a Rees = SSS 
| 1 ! I i I [ 

10b ----+----- +----- hRe----H----+- toe hKe---H--- -- 


Time (sec) 
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and this results in the feedback gain 
K = [-0458 — 0.249 0.568 0.968], (8.19) 


which produces the response to an initial condition, d = 1 m, shown in Fig. 8.2(b). It 
exhibits much less response of d with no increase in control effort, although the resonant 
mode oscillations did influence the response with damping consistent with the poles 
selected. The primary reason for the superior response is that the oscillatory mode was 
not changed substantially by the control. Two of the selected poles (z = 0.8 + j0.4) have 
a natural frequency w, = l rad/sec with a damping ¢ = 0.2. Therefore, the control is not 
attempting to change the natural frequency of the resonant mode at all; rather, the only 
task for the control in modifying this pole is to increase its damping from ¢ = 0.02 to 
¢ = 02. Since the mode remains lightly damped, its oscillations are still visible on the 
output. The selected poles at z = 0.9 + j0.05 affect the overall motion of the system and 
their placement is less critical. Generally, pole selections with a damping Z = 0.7 give a 
better balance between system response and control usage. The control is clearly much 
more effective with these desired pole locations. 


controller pole selection 


[4 


8.2 


* 


So we see that the mechanics of computing the control law are easy, once 
the desired pole locations are known. The trick is to pick a good set of poles! The 
designer should iterate between pole selections and some other system evaluation 
to determine when the design is complete. System evaluation might consist of an 
initial-condition time response as shown in the example, a step response, steady- 
state errors, gain and phase margins, or the entire frequency-response shape. Pole 
placement by itself leaves something to be desired. But it is useful as a design tool 
to be used in conjunction with the other design methods discussed in Chapter 7 
or as a part of an optimal design process that will be discussed in Chapter 9. 


Estimator Design 


The control law designed in the last section assumed that all state elements were 
available for feedback. Typically, not all elements are measured; therefore, the 
missing portion of the state needs to be reconstructed for use in the control law. 
We will first discuss methods to obtain an estimate of the entire state given a 
measurement of one of the state elements. This will provide the missing elements 
as well as provide a smoothed value of the measurement, which is often contam- 
inated with random errors or “noise.” There are two basic kinds of estimates of 
the state, x(k): We call it the current estimate, X(k), if based on measurements 
y(k) up to and including the kth instant; and we call it the predictor estimate, 
x(k), if based on measurements up to y(k — 1). The idea eventually will be to let 
u = —KX or u = —KX, replacing the true state used in Eq. (8.5) by its estimate. 
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8.2.1 


Figure 8.3 
Open-loop estimator 


Prediction Estimators 


One method of estimating the state vector which might come to mind is to 
construct a model of the plant dynamics, 


x(k + 1) = x(k) + Tuk). (8.20) 


We know ®, T, and u(k), and hence this estimator should work if we can obtain 
the correct x(0) and set X(0) equal to it. Figure 8.3 depicts this “open-loop” 
estimator. If we define the error in the estimate as 


X-x-k, (8.21) 


and substitute Eqs. (8.3) and (8.20) into Eq. (8.21), we find that the dynamics of 
the resulting system are described by the estimator-error equation 


x(k + 1) = Óx(K). (8.22) 


Thus, if the initial value of x is off, the dynamics of the estimate error are those 
of the uncompensated plant, ®. For a marginally stable or unstable plant, the 
error will never decrease from the initial value. For an asymptotically stable 
plant, an initial error will decrease only because the plant and estimate will both 
approach zero. Basically, the estimator is running open loop and not utilizing 
any continuing measurements of the system’s behavior, and we would expect 
that it would diverge from the truth. However, if we feed back the difference 
between the measured output and the estimated output and constantly correct the 
model with this error signal, the divergence should be minimized. The idea is to 
construct a feedback system around the open-loop estimator with the estimated 
output error as the feedback. This scheme is shown in Fig. 8.4; the equation for 
it is 


x(k + 1) = x(k) + T'u(k) + L [y(k) — Hx(k)], (8.23) 


Figure 8.4 


Closed-loop estimator 


estimator error equation 
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where L, is the feedback gain matrix. We call this a prediction estimator because 
a measurement at time k results in an estimate of the state vector that is valid at 
time k + 1; that is, the estimate has been predicted one cycle in the future. 

A difference equation describing the behavior of the estimation errors is 
obtained by subtracting Eq. (8.23) from Eq. (8.3). The result is 


x(k + 1) = [P — L H]X(&). (8.24) 


This is a homogeneous equation, but the dynamics are given by [® — LH]; and 
if this system matrix represents an asymptotically stable system, X will converge 
to zero for any value of x(0). In other words, x(k) will converge toward x(k) 
regardless of the value of x(0) and could do so faster than the normal (open-loop) 
motion of x(k) if the estimator gain, L,» were large enough so that the roots of 
$- L,H are sufficiently fast. In an actual implementation, x(k) will not equal 
x(k) because the model is not perfect, there are unmodelled disturbances, and the 
sensor has some errors and added noise. However, typically the sensed quantity 
and L can be chosen so that the system is stable and the error is acceptably small. 

To find the value of L ,, we take the same approach that we did when designing 
the control law. First, specify the desired estimator pole locations in the z-plane 
to obtain the desired estimator characteristic equation, 


(z — B,)(z — 8)---(z— B,) =9, (8.25) 


where the fs are the desired estimator pole locations’ and represent how fast 
the estimator state vector converges toward the plant state vector. Then form the 
characteristic equation from the estimator-error equation (8.24), 


| zI- ® + L,H| = 0. (8.26) 
4 The following sections discuss how one should select these poles in relation to the control poles and 


how both sets of poles appear in the combined system. The issue was also discussed for the continuous 
case in Section 2.6.2. 
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Equations (8.25) and (8.26) must be identical. Therefore, the coefficient of each 
power of z must be the same, and, just as in the control case, we obtain n equations 
in n unknown elements of L, for an nth-order system. 


€ Example 8.4 


Estimator Design for Satellite Attitude 


Construct an estimator for the same case as in Example 8.1, where the measurement is the 
position state element, x,, so that H = (1 0] as given by Eq. (4.47). Pick the desired poles of 
the estimator to be at z = 0.4 + j0.4 so that the s-plane poles have ¢ = 0.6 and c, is about 
three times faster than the selected control poles (see Fig. 8.1). 

Solution. The desired characteristic equation is then (approximately) 


z? — 0.8z + 0.32 = 0, (8.27) 


and the evaluation of Eq. (8.26) forany estimator gain L, leads to 
ki 50 LE T L 
= pl 
gn tl [ote] ie fee 


2 +4 (L,;— 22+ TL atl —L,;= 0. (8.28) 


or 


Equating coefficients in Eqs. (8.27) and (8.28) with like powers of z, we obtain two simulta- 
neous equations in the two unknown elements of L, 
Ly, —2= 0.8 
Pp 
TL,*1—L, 2032, 


which are easily solved for the coefficients and evaluated for T — 0.1 sec 
Loo I2 La = = 52. (8.29) 


Thus the estimator algorithm would be Eq. (8.23) with L, given by Eq. (8.29), and the 
equations to be coded in the computer are 


E(k + 1) = X, (k) +0.15, (k) + 0.005u(k) + 1.2[y(k) — X, 00] 
E(k + 1) = x) + 0. lu (k) + 5.21y(k) — x, (k)]. 


Figure 8.5 shows the time history of the estimator error from Eq. (8.24) for the gains in 
Eq. (8.29) and with an initial error of 0 for the position estimate and 1 rad/sec for the velocity 
estimate. 

The transient settling in X, could be hastened by higher values of the gains, L, that would 
result by selecting faster estimator poles, but this would occur at the expense of more response 
of both X, and X, to measurement noise. 


* 


Figure 8.5 

Time history of the 
prediction estimator 
error 
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Estimate Errors, X; and X? 


Time (sec) 


It is important to note that an initial estimator transient or, equivalently, the 
occurrence of an unmodelled input to the plant, can be a rare event. If the problem 
is one of regulation, the initial transient might be unimportant compared to the 
long-term performance of the estimator in the presence of noisy measurements. 
In the regulator case with very small plant disturbances, very slow poles (maybe 
even slower than the control poles) and their associated low estimator gains would 
give smaller estimate errors. Optimal selection of estimator gains based on the 
system's noise characteristics will be discussed in Chapter 9. 


Observability 


Given a desired set of estimator poles, is L, uniquely determined? It is, provided 
y is a scalar and the system is “observable.” We might have an unobservable 
system if some of its modes do not appear at the given measurement. For example, 
if only derivatives of certain states are measured and these states do not affect 
the dynamics, a constant of integration is obscured. This situation occurs with 
a 1/s? plant if only velocity is measured, for then it is impossible to deduce 
the initial condition of the position. For an oscillator, a velocity measurement 
is sufficient to estimate position because the acceleration, and consequently the 
velocity, observed are affected by position. A system with cycle delays can 
also be unobservable because the state elements representing the delays have 
no influence on the measurement and can therefore not be reconstructed by the 
measurements. A mathematical test for observability is stated in the next section 
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8.2.3 


observability matrix 


as a necessary condition for the solution of Ackermann’s formula; its proof is 
given in Section 8.7. 


Pole Placement Using CACSD 


If we take the transpose of the error-equation system matrix from Eq. (8.24) 
[P —L,H]’ = ^ —H’L!, 


we see that the result is the same form as the system matrix ® — IK of the 
control problem from Eq. (8.6), and the mathematics of the solution is the same. 
Therefore, to solve the problem, we substitute ®’ for $, H” for [and L; for K, 
and use the control-design results. Making the substitutions in Eq. (8.15) results 
in Ackermann's estimator formula 

-1 


H 0 
H® 0 
L,=a,@)| H9' T" (8.30) 
Ho"! | 
where 
a (b) =O" o," +a D E... Fo I, (8.31) 


and the œ,’s are the coefficients of the desired characteristic equation, that is 
a.(z2) —(z—B)0z—B8)--(z—B)-z' oz" 6m. (832) 


The coefficient matrix with rows HỌ” is called the observability matrix and 
must be full rank for the matrix to be invertible and for the system to be observable. 

For calculation of L, with MATLAB, either acker or place can be used by 
invoking the substitutions above. The same restrictions apply that existed for the 
control problem. 


@ Example 8.5 


Predictor Estimator Pole Placement with MATLAB 
Design an estimator for the satellite attitude-control system as in Example 8.4. Place the z-plane 
poles at z = 0.4+ j0.4. 
Solution. The MATLAB statements 
T= 
Phi = (1 TO 1] 
Gam = [T?/2;T] 
p = [.4+i*.4;.4-i*.4] 


8.2 Estimator Design 295 


Lp = acker(Phi’,H’,p)’ 


result in 


place would have given the same answer. 


8.2.4 


Current Estimators 


As was already noted, the previous form of the estimator equation (8.23) arrives 
at the state vector estimate x after receiving measurements up through y(k — 1). 
This means that the current value of control’ does not depend on the most current 
value of the observation and thus might not be as accurate as it could be. For high- 
order systems controlled with a slow computer or any time the sample periods are 
comparable to the computation time, this delay between the observation instant 
and the validity time of the control output can be a blessing because it allows 
time for the computer to complete the calculations. In many systems, however, the 
computation time required to evaluate Eq. (8.23) is quite short compared to the 
sample period, and the delay of almost a cycle between the measurement and the 
proper time to apply the resulting control calculation represents an unnecessary 
waste. Therefore, it is useful to construct an alternative estimator formulation that 
provides a current estimate X based on the current measurement y(k).° Modifying 
Eq. (8.23) to yield this feature, we obtain 


&(k) = x(k) + L,[y(k) — Hx(k)], (8.33) 


where x(k) is the predicted estimate based on a model prediction from the 
previous time estimate, that is 


x(k) = O&(k — 1) + Tu(k — 1). (8.34) 


Control from this estimator cannot be implemented exactly because it is impossi- 
ble to sample, perform calculations, and output with absolutely no time elapsed. 
However, the calculation of u(k) based on Eq. (8.33) can be arranged to minimize 
computational delays by performing all calculations before the sample instant that 
do not directly depend on the y(k) measurement. In Chapter 3, we also had a 
u (k) that was dependent on y(k) (see Table 3.1); and here, too, we organized the 
calculations to minimize the delay. If this latency in the implementation causes 
significant errors in the performance of the system compared to the analysis, 


5 We plan to use u (k) = —Kx in place of u(k) = —Kx. 
6 This form of the equations is used in the Kalman filter, which is discussed in Chapter 9. 
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it could be modeled and accounted for in the estimator equations by using the 
results of Section 4.3.4. 

To help understand the difference between the prediction and current form 
of estimation—that is, Equations (8.23) and (8.33)—it is useful to substitute 
Eq. (8.33) into Eq. (8.34). This results in 


x(k + 1) = dx(&) + Pu(k) + 9L [y(k) — Hx(k)]. (8.35) 


Furthermore, the estimation-error equation for x(k), obtained by subtracting 
Eq. (8.3) from Eq. (8.35), is 


x(k + 1) = [P — ®L_H)X(k). (8.36) 


By comparing Eqs. (8.35) with (8.23) and (8.36) with (8.24), we can conclude 
that x in the current estimator equation, (8.33), is the same quantity as x in the 
predictor estimator equation, (8.23), and that the estimator gain matrices are 
related by 


L, —4L. (8.37) 


The relationship between the two estimates is further illuminated by writing 
Eqs. (8.33) and (8.34) as a block diagram, as in Fig. 8.6. It shows that X and Xx 
represent different outputs of the same estimator system. 

We can also determine the estimator-error equation for X by subtracting 
Eq. (8.33) from (8.3). The result is’ 


x(k + 1) = [P — L_H®)x(k). (8.38) 


The two error equations, (8.36) and (8.38), can be shown to haye the same 
roots, as should be the case because they simply represent the dynamics of 
different outputs of the same system. Therefore, we could use either form as the 
basis for computing the estimator gain, L,. Using Eq. (8.38), we note that it is 
similar to Eq. (8.24) except that H® appears instead of H. To use Ackermann’s 
formula for L, we use Eq. (8.30) with H replaced by H® and find 


H^ | ro 
HỌ’ 0 

L =g, (0) | H® 0|. (8.39) 
Ho" i 


where a (o) is based on the desired root locations and is given by Eqs. (8.31) 
and (8.32). To use the control form of Ackermann’s formula to perform the 
calculations, we take the transpose of  — L, H«P and get ^ — ®'H’L!, which 
is the same form as the system matrix P — IK of the control problem. Therefore 


7 Equation (8.21) defines X to be x — x; however, we use x here to be x — x. In both cases, X refers to the 
estimator error. 


Figure 8.6 
Estimator block diagram 
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substitutions P" for ® and ^ H? for T yield L, instead of K. Alternatively, we 
could compute L, using Eq. (8.30) and then compute L, using Eq.(8.37); that is 


L-96L. (8.40) 


€ Example 8.6 


Current Estimator Pole Placement with MATLAB 
Design a current estimator for the satellite attitude-control system as in Examples 8.4 and 8.5. 
Place the z-plane poles at z = 0.4 + j0.4 and compare the error response to an initial error in 
the velocity with that obtained in Example 8.4. 
Solution. The MATLAB statements 

T=.1 

Phi = [1 T;0 1] 

Gam = [T^/2;T] 

p = [.4+i*.4;.4-i*.4] 

Lc = acker(Phi’,Phi’*H’,p)’ 


result in 


Therefore, the estimator implementation using Eq. (8.33) ina way that reduces the computation 
delay as much as possible is, before sampling 


X,(k) = 4, (k — 1) + 0.005 u(k — 1) + 0.1 £,(k — 1), 
X (k) = £ (k — 1) + O.1u(k — 1), 
x, = (1 — 0.68) ¥,(k), 
x, = (1 — 52A 9 X, 
and after sampling y(k) 
&,0) = xj + 0.68 y(k), 
£(k) = xj + 5.2 y(k), 
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at which time the state vector estimate is available for the calculation of the control u(x). 

Figure 8.7 shows the time history of the estimator error equation from Eq. (8.38), again 
with an initial error of O for the position estimate and 1 rad/sec for the velocity estimate. The 
figure shows very similar results compared to the prediction estimator in Fig. 8.5; however, 
the current estimator implementation exhibits a response which is about a cycle faster. 


Figure 8.7 
Time history of current 
estimator error 


* 


If ® is singular, as can happen with systems having time delays, neither 
Eqs. (8.39) nor (8.40) can be used. However, estimators can be designed for these 
systems as discussed in Section 8.6. 

Note that we now have two estimates that could be used for control purposes, 
the predicted estimate [X(k) from Eq. (8.23)] and the current estimate [£(k) from 
Eq. (8.33)]. The current estimate is the obvious choice because it is based on 
the most current value of the measurement, y. Its disadvantage is that it is out 
of date before the computer can complete the computation of Eqs. (8.33) and 
(8.5), thus creating a delay that is not accounted for in the design process, which 
will cause less damping in the implemented system than specified by the desired 
poles. The use of the predicted estimate for control eliminates the modeling error 
from the latency because it can be calculated using the measurement, y(k — 1), 
thus providing an entire sample period to complete the necessary calculations 
of u(k) before its value is required. Generally, however, one should use the 


> 
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current estimate because it provides the fastest response to unknown disturbances 
or measurement errors and thus better regulation of the desired output. Any 
deficiencies in the system response due to the latency from the computation lag 
that is found by simulation or experiment can be patched up with additional 
iterations on the desired pole locations or accounted for exactly by including 
computation delay in the plant model. 


Reduced-Order Estimators 


The estimators discussed so far are designed to reconstruct the entire state vector, 
given measurements of some of the state elements. One might therefore ask: 
Why bother to reconstruct the state elements that are measured directly? The 
answer is: You don't have to, although, when there is significant noise on the 
measurements, the estimator for the full state vector provides smoothing of the 
measured elements as well as reconstruction of the unmeasured state elements. 

To pursue an estimator for only the unmeasured part of the state vector, let 
us partition the state vector into two parts: x, is the portion directly measured, 
which is y, and x, is the remaining portion to be estimated. The complete system 
description, like Eq. (8.3), becomes 


x(k +1) = o o x, (k) b. 
E: +1) | k | $,, o, | | x,(k) | T l T, | x0. (8.41) 


(k) 
(=! 0 l Ta | (8.42) 
ý x, (k) 
and the portion describing the dynamics of the unmeasured state elements is 
X, (k + 1) = ®,,x,(k) + ®, x (k) + Tuk), (8.43) 
N ——— 
known "input" 


where the right-hand two terms are known and can be considered as an input into 
the x, dynamics. If we reorder the x, portion of Eq. (8.41), we obtain 


x (k 4- 1) — 6, x, (k) — T u (k) = x, (X). (8.44) 


aa a 
known “measurements” 
Note that this is a relationship between a measured quantity on the left and the 


unknown state vector on the right. Therefore, Eqs. (8.43) and (8.44) have the 
same relationship to the state vector x, that the original equation, (8.3), had to the 


8 Reduced-order estimators (or observers) were originally proposed by Luenberger (1964). This 
development follows Gopinath (1971). 
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entire state vector x. Following this reasoning, we arrive at the desired estimator 
by making the following substitutions 


X 4— €. 
$9 <= Dy 
Pulk) <— 9,x (k) Y ',u(&), 
NO s— a kt =O a ulk); 
H — 0. 


into the prediction estimator equations (8.23). Thus the reduced-order estimator 
equations are 


& (kc 1) = OF) + 6, x 0) - T,u(k) 


+L [x,k 4-1) — 6,x,0) -T,u() - R00). (859 
Subtracting Eq. (8.45) from (8.43) yields the error equation 
X,(k +1) = [6,, — L, 5, x, (k), (8.46) 
and therefore L, is selected exactly as before, that is, (a) by picking roots of 
Izl- 6, +L,®,,|=a,(z) =0 (8.47) 
to be in desirable locations, or (b) using Ackermann's formula 
o, ih 0 
PpP b 0 
L, =a,(®,,) | 9.49. T*7 (8.48) 
; 0 
$5 1 


We note here that Gopinath (1971) proved that if a full-order estimator as given 
by Eq. (8.23) exists, then the reduced-order estimator given by Eq. (8.45) also 
exists; that is, we can place the roots of Eq. (8.47) anywhere we choose by choice 
ofL. 


@ Example 8.7 


Reduced-Order Estimator for Satellite Attitude 


Determine a reduced-order estimator for the same case as in Examples 8.4 and 8.5. 


Solution. We start out by partitioning the plant equations to fit the mold of Eqs. (8.41) and 


(8.42). This results in 
Es es |=[ 4 3 Le Jd e REP | 
o, 9, GT? n, T 0.1 


x, |, | the measured position state y, 8.49 
— | the velocity to be estimated (8:43) 


8.2 Estimator Design 301 


where o o and so on, are all scalars. Therefore D is a scalar also, and there is only one 
estimator pole to pick, the pole corresponding to the speed at which the estimate of scalar 
velocity converges. From Eq. (8.47) we pick L, from 


a a p 


For this estimator to be about the same speed as the two previous estimator examples, which 
had two poles at z = 0.4 + j0.4, we will pick the pole at z = 0.5; therefore L, T — 1 = —0.5 
and L, = 5. The estimator equation, (8.45), is 


5, (k) = X, (k — 1) +0.1 u(k — 1) 
+5.0 [y(k) — y(k — 1) — 0.005 u(k — 1) ~ (0.1)2, (k — 1)]. (8.50) 
The implementation in a control computer would, before sampling, look something like 
x’ = 0.5, (k — 1) + 0.075 u(k — 1) - 5 y(k — 1), 
and after sampling 
2 (k) =x’ + Sy(k). 


Figure 8.8 shows the time history of the estimator-error equation (8.46) with an initial (velocity) 
estimate error of | rad/sec. The figure shows very similar results compared to the velocity 
element estimates in Figs. 8.5 and 8.7. Of course, there is no position estimate because this 
formulation assumes that the measurement is used directly without smoothing. 


Figure 8.8 
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8.3 


8.3.1 


Regulator Design: Combined Control Law 

and Estimator 

If we take the control law (Section 8.1) and implement it, using an estimated 
state vector (Section 8.2), the control system can be completed. A schematic of 
such a system is shown in Fig. 8.9. However, because we designed the control 
law assuming that the true state, x, was fed back instead of X or x, it is of interest 
to examine what effect this has on the system dynamics. We will see that it has 


no effect! The poles of the complete system consisting of the estimator feeding 
the control law will have the same poles as the two cases analyzed separately. 


The Separation Principle 
The control is now 
u(k) = —Kx(k) 
and the controlled plant equation (8.6) becomes 
x(k + 1) = $x(k) — TKX (k), (8.51) 
which can also be written in terms of the estimator error using Eq. (8.21) 
x(k + 1) = ®x(k) — TK (x(k) — X(k)). (8.52) 


Combining this with the estimator-error equation (8.24)? we obtain two coupled 
equations that describe the behavior of the complete system” 


Xk-D| |9-—-LJH 0 x(k) (8.53) 
x(k+1) | TK ®-TK }| x(k) J 
The characteristic equation is 
zI-Ď+L H 0 
TK ' aA-®+PK |^: pog 


which, because of the zero matrix in the upper right, can be written as 
r- e L,B|r- e«TK| = oo, =0. (8.55) 


In other words, the characteristic poles of the complete system consist of the 
combination of the estimator poles and the control poles that are unchanged 
from those obtained assuming actual state feedback. The fact that the combined 
control-estimator system has the same poles as those of the control alone and the 


9 We show only the prediction estimator case. The other estimators lead to identical conclusions. 


10 This description of the entire system does not apply if the estimator model is imperfect; see Section 
11.5 for an analysis of that case. 


Figure 8.9 
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Estimator and controller mechanization 


Estimator 


x(k*1) 2 e&x(k) +T u(k) 
+L, Dk) — Hx(k) 


or Eq. (8.33) or (8.45) 


compensator 


estimator alone is a special case of the separation principle by which control and 
estimation can be designed separately yet used together. 

To compare this method of design to the methods discussed in Chapter 7, we 
note from Fig. 8.9 that the portion within the dashed line corresponds to classical 
compensation. The difference equation for this system or "state-space designed 
compensator” is obtained by including the control feedback (because it is part 
of the compensation) in the estimator equations. Using Eq. (8.23) yields for the 
prediction estimator 


x(k) = [b -TK — L,Hix(k — 1) + L,y(k — 1), 
u(k) = —Kx(k), (8.56) 
and using Eq. (8.33) yields for the current estimator 
K(k) = [P — FK - L'HO + L HK ]x(k — 1) + L y(&), 


u(k) = —Kx(k). (8.57) 
The poles of the compensators above are obtained from, for Eq. (8.56), 
IzI- 6 - TK -L,H| —0 (8.58) 


and, for Eq. (8.57), 
IzI- 6 - TK + L,H® - L HF K | = 0, (8.59) 


andare neither the control law poles, Eq. (8.8), nor the estimator poles, Eq. (8.25). 
These poles need not be determined during a state-space design effort, but can 
be of interest for comparison with compensators designed using the transform 
methods of Chapter 7. 

If desired, Eq. (8.56) can be converted to transfer function-form using the 
same steps that were used in arriving at Eq. (4.64). This results in what was called 
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estimator pole selection 


compensation in Chapter 7 and usually referred to as D(z). For the prediction 
estimator, we find 
eh. = ip ie) = Ki d PRL E) (8.60) 
Y(z) P P p 
The D(z) could also be found from Eq. (8.56) by using tf.m in MATLAB or, for 
the current estimator, from Eq. (8.57). Likewise, the transfer function for the 
reduced-order compensator is found by using the measured part of the state, x , 
directly in the control law and the estimated part, %,(k), for the remainder. Thus, 
the control gain K needs to be partitioned, so that 


u(k) =[K, SIN where K=[K, K,]. (&61) 
b 


In the previous sections, we developed techniques to compute K and L 
(which define the compensation), given the desired locations of the roots of the 
characteristic equations of the control and the estimator. We now know that these 
desired root locations will be the closed-loop system poles. The same meter sticks 
that applied to the classical design and were discussed in Section 7.2 also apply 
to picking these poles. In practice, when measurement noise is not an issue, it is 
convenient to pick the control poles to satisfy the performance specifications and 
actuator limitations, and then to pick the estimator poles somewhat faster (by a 
factor of 2 to 4) so that the total response is dominated by the response due to the 
slower control poles. It does not cost anything in terms of actuator hardware to 
increase the estimator gains (and hence speed of response) because they appear 
only in the computer. The upper limit to estimator speed of response is based on 
the behavior of sensor-noise rejection, which is the subject of Chapter 9. 

In order to evaluate the full system response with the estimator in the loop, 
it is necessary to simulate both the real system and the estimator system as was 
formulated in Eq. (8.53). However, it is easier to see what is going on by using x 
or X in place of x. The result for the predictor case using Eq. (8.56) is 


xk+D) | _ o -IK x(k) (8.62) 
K(kK+1) | ~ LL,H p -TK-LjJH x(k) |’ i 
the result for the current estimator using Eq. (8.57) is 
ns 19. | _ o -IK x(k) (8.63) 
xk-1)]| ~ | LH® &®-TK-L,H® x(k) |' à 


and the result for the reduced-order estimator using Eq. (8.45) is 


x(k+1) ] _ [ ®-I[ K, 0] -TK x(k) 
eso] = [OTE Belso) e 


where 


A=LH®+, H — l',KaH — L,® H 
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and 
B= 9, -D,K,-L 9,, 


and where K, and K, are partitions of K according to the dimensions of x, and x,. 


€ Example 8.8 


Compensation Based on the Predictor Estimator 


Put together the full feedback control system based on the calculations already done in Exam- 
ples 8.1 and 84, that is, using the prediction estimator. Use the control gain, K — [10 3.5], 
and the estimator gain, p = [1.2 5.2] Determine the D(z) for comparison with a classical 
lead compensation. Plot the response of the system variables for an initial plant velocity of 
— | rad/sec and zero for all other initial conditions. Comment on whether the responses are 
consistent with your expectations. 


Solution. The compensation equations consist of Eq. (8.56) with the values of K and L 
plugged in and, being in difference-equation form, can be coded directly in a control computer. 
To find the transfer function form, we use zp.m (or Eq. (8.60)), to find that! 


z — 0.825 
z —02 + j0.557 


There is a compensator zero near the two plant poles at z = +1 and there are two compensator 
poles considerably to the left. This is very similar to a classical lead compensator except that 
it has two poles instead of one. State-space design using a full-order estimator will always 
produce compensation that is the same order as the plant. Note that the difference equation 
that results from this D(z) will have a one cycle delay between the input and output. 

Figure 8.10 shows the response of the system and controller to the initial conditions. This 
could be thought of as a condition that would result from a sudden disturbance on the system. 
Note the estimator error at the beginning which decays in about 0.7 sec, consistent with the 
estimator poles. The overall system response is slower and has a settling time of about 2.5 sec, 
consistent with the control poles. 


D,(z) = —30.4 (8.65) 


* 


€ Example 8.9 


Compensation Based on the Current Estimator 


Put together the full feedback control system based on the calculations already done in Ex- 
amples 8.1 and 8.5, that is, using the current estimator. Use the control gain, K = [10 3.5], 
and the estimator gain, LT = [0.68 5.2]. Determine the D(z) for comparison with a classical 
lead compensation. Plot the response of the system variables for an initial plant velocity of 
—] rad/sec and zero for all other initial conditions. Comment on whether the responses are 
consistent with your expectations. 


1] Equation (8.60) includes a minus sign because it is the transfer function from Y (s) to U (s) rather than 
from E(s), as is the normal convention used in Chapter 7. 
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Figure 8.10 

Time histories of 
controlled system with 
prediction estimator, 
Example 8.8 


Time (sec) 


Solution. The compensation equations consist of Eq. (8.57) and, with the values of K and 
L, plugged in, we find that 


z(z — 0.792) 
z — 0.265 + j0.394° 


This compensation is very much like that from the prediction estimator; however. because of 
the extra z in the numerator, there is no | cycle delay between input and output. This faster 
cycle response required less lead from the compensation, as exhibited by the zero being further 
from z = 1. 

Figure 8.11 shows the response of the controlled system to the initial conditions. Note 
the somewhat faster response as compared to Fig. 8.10 due to the more immediate use of the 
measured signal. 


D.(z) = —25.1 (8.66) 


* 


€ Example 8.10 


Compensation Based on the Reduced-Order Estimator 


Put together the full feedback control system based on the calculations already done in Exam- 
ples 8.1 and 8.6, that is, using the reduced-orderestimator. Use the control gain, K = [10 3.5], 
and the estimator gain, L7 = 5. Determine the D(z) for comparison with a classical lead com- 
pensation. Plot the response of the system variables for an initial plant velocity of —1 rad/sec 
and zero for all other initial conditions. Comment on whether the responses are consistent with 
your expectations. 

Also, use the D(z) to construct a root locus for this system and show where the desired 
root locations for the control and estimation lie on the locus. 


Figure 8.11 

Time histories of 
controlled system with 
current estimator, 
Example 8.9 


8.3 Regulator Design: Combined Control Law and Estimator 307 


0 0.5 l 1:5 2 2:5 3 
Time (sec) 


Solution. The compensation equations consist of Eqs. (8.50) and (8.61). With the values of 
K and L, plugged in, we find after much algebra that 


z — 0.8182 
z — 0.2375 ` 

Figure 8.12 shows the response of the system to the initial conditions. It is very similar 
to that of Example 8.9; the only notable difference is the first-order response of the estimator 
error, which slightly reduced the control usage. 

This compensation now looks exactly like the classic lead compensation that was used 
often in Chapter 7 and would typically be used for a 1/s? plant. A sketch of a root locus vs. K 
is given in Fig. 8.13. For this design a gain of 27.7 is now the variable K. 

The closed-loop root locations corresponding to K = 27.7 are indicated by the triangles 
and lie on the two control roots at z = 0.8 + j0.25 and on the one estimator root at z = 0.5, 
as they should. 


D, (2) = -27.7 (8.67) 


: * 


The higher order compensators that resulted in Examples 8.8 and 8.9 have the 
benefit of more attenuation at high frequencies, thus reducing the sensitivity to 
measurement noise. This also follows from the fact that they provided a smoothed 
value of the measured output as well as reconstructing the velocity state variable. 
Full order estimators are also easier to implement because of the simpler matrix 
equations that result using u = —Kx rather than the partitioned Eq. (8.61). As a 
result, reduced-order estimation is not often used in practice. 
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Figure 8.12 

Time histories of system 
with reduced-order 
estimator, Example 8.10 
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Figure 8.13 1 
Sketch of the root locus 
for Example 8.10 0.8 
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8.3.2 Guidelines for Pole Placement 


The selection criteria of the closed-loop control and estimator poles (or roots) 
have been encountered throughout the examples in Chapter 8 to this point. Also 
see the review of state-space design for continuous systems in Section 2.6 as well 
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as Franklin, Powell, and Emami-Naeini (2019), Sections 7.6 and 7.7.3. The key 
idea for control-pole selection is that one needs to pick the poles so that the 
design specifications are met while the use of control is kept to a level that is no 
more than needed to meet the specifications. This pole-selection criterion will 
keep the actuator sizes to a minimum, which helps to minimize the cost and 
weight of the control system. The relationships between various system 
specifications developed in Section 7 .1 can be used as an aid in the 
pole-selection process. For high-order systems, it is sometimes helpful to use 
the ITAE or Bessel prototype design root locations as discussed in Section 7.6 in 
Franklin, Powell, and Emami- Naeini (2019). For the case where there is a 
lightly damped open-loop mode, a technique that minimizes control usage is 
simply to add damping with little or no change in frequency, a technique called 
radial projection that was demonstrated in Example 8.3. 

The optimal design methods discussed in Chapter 9 can also be used to select 
pole locations. They are based on minimizing a cost function that consists of the 
weighted sum of squares of the state errors and control. The relative weightings 
between the state errors and control are varied by the designer in order to meet 
all the system specifications with the minimum control. Optimal methods can be 
applied to the SISO systems, which are the subject of this chapter, or to MIMO 
systems. 

Estimator-error pole selection is a similar kind of design process to the 
control-pole selection process; however, the design trade-off is somewhat differ- 
ent. Fast poles in an estimator do not carry the penalty of a large actuator like 
they do in the control case because the large signals exist only in the computer. 
The penalty associated with fast estimator poles is that they create an increased 
sensitivity between sensor errors and estimation errors. 

The key idea for estimator-error pole selection is that the estimation errors 
should be minimized with respect to the prevailing system disturbances and 
sensor noise. It is also convenient to keep the estimator poles faster than the 
control poles in order that the total system response is dominated by the control 
poles. Typically, we select well-damped estimator poles that are two to six times 
faster than the control poles in order to provide a response dominated by the 
control poles. For cases where this criterion produces estimation errors due to 
sensor noise that are unacceptably large, the poles can be slowed down to be less 
than two times the control poles; however, in this case the total response could 
be strongly influenced by the location of the estimator poles, thus coupling the 
estimator design with the control design and complicating the process. 

In the optimal estimation discussion in Chapter 9 we will see that the optimal 
estimator error poles are proportional to the ratio between the plant model errors 
and the sensor errors. For an accurate plant model with small disturbances but 
large sensor errors, the optimal estimation is achieved with very low estimator 
gains (slow response) because the estimator is best served by relying primarily 
on the plant model. On the other hand, a system with a plant model that includes 
the possibility of large disturbances but with an accurate sensor achieves the best 
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8.4 


8.4.1 


Figure 8.14 

Block diagram for 
full-state feedback with 
reference input 


estimation by using a large estimator gain (fast response) in order to use the 
sensed information to correct the model errors as quickly as possible. 


Introduction of the Reference Input 


The compensation obtained by combining the control law studied in Section 
8.1 with any of the estimators of Section 8.2 is a regulator design in that the 
goal was to drive all states to zero. We designed the characteristic equations 
of the control and the estimator to give satisfactory natural mode transients to 
initial conditions or disturbances, but no mention was made of how to structure 
a reference input or of the considerations necessary to obtain good transient 
response to reference inputs. To study these matters we will consider first how 
one introduces a reference input to the full-state feedback case, and then we will 
proceed to the case with estimators. We then turn to a discussion of output error 
command, a structure that occurs when the sensor is capable of providing only 
an error signal, for example, an attitude error from a gyro or the pointing error 
from a radar signal. The output error structure is also the one that results if one 
is designing the compensation using transfer-function methods and the reference 
input is structured according to Fig. 7.5, the typical case. It is, therefore, of 
interest to study this structure in order to understand the impact of its use on the 
dynamic response of the system. In conclusion, we will discuss the implications 
of this section’s results and compare the relative advantages of the structure 
made possible by the state-space control/estimation approach with the classical 
approach. 


Reference Inputs for Full-State Feedback 


Let us first consider a reference input for a full-state feedback system as in 
Eq. (8.5). The structure is shown in Fig. 8.14 and consists of a state command 
matrix N, that defines the desired value of the state, x,. We wish to find N, so 
that some system output, y, — H x, is at a desired reference value. This desired 
output, y,, might not be the same quantity that we sense and feed to an estimator 
that has been called y and determined by H in the previous two sections. 
Although so far in this book we have only considered systems with a single 
control input and single output (SISO), Chapter 9 considers the case of more 
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than one input and output (MIMO), and, therefore, we will allow for that in the 
development here and in the following subsection. We will, however, require 
that the number of inputs in u and desired outputs in y, be the same." The 
basic idea in determining N, is that it should transform the reference input, r, 
to a reference state that is an equilibrium one for that r. For example, for a step 
command to Example 8.1, NT —[1 OJ]:thatis, we wish to command the position 
state element, but the velocity state element will be zero in steady state. For the 
double mass-spring system of Example 8.3, if we desire that d = r—that is, 
that H, — [1 0 O0 0O0]—then we should set NT —[1 O 1  O) because that 
provides a state reference, x , that, if matched by the actual state, is at equilibrium 
for the desired output. 
More specifically, we have defined N, so that 


N.r—x, and | u- -K(x- x). (8.68) 


If the system is Type 1 or higher and r is a step, there will be no steady-state 
error, and the final state 
x(oo) = x, =X,. 


r 


For Type 0 systems, there will be an error because some control is required to 
maintain the system at the desired x . 

Often the designer has sufficient knowledge of the plant to know what the 
equilibrium state is for the desired output, in which case the determination of N, 
is complete. For complex plants, however, this can be difficult. In these cases, it 
is useful to solve for the equilibrium condition that satisfies y, — r.? 

In order for the solution to be valid for all system types, whether they require 
a steady-state control input or not, we will include the possibility of a steady-state 
control term that is proportional to the reference input step, that is, 


u, = N,r, (8.69) 


as shown in Fig. 8.15(a). The proportionality constant, N,, will be solved for in 
the formulation. 

If the resulting u, is actually computed and implemented in the reference 
input structure, we refer to it as “feedforward,” but the feedforward component 
of the input is often not used. Instead, the preferred method of providing for zero 
steady-state error is through integral control or bias estimation, which essentially 
replaces the u,. in Fig. 8.15(a) with an integral that is an estimate of the steady- 
state control, a topic that is the subject of Section 8.5. In some cases, it is difficult 
to achieve a high enough bandwidth when replacing feedforward with integral 
control; therefore, feedforward is sometimes used to reduce the demands on the 
integral so that it need only provide the error in the feedforward control, thus 


12 This is the only case that has a unique and exact answer, although other situations have been studied. 


13 See Trankle and Bryson (1978) for a more complete discussion and extensions of the ideas to inputs 
other than steps, often called "model following.” 
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Figure 8.15 

(a) Block diagram and 
(b) modified block 
diagram for full-state 
feedback with reference 
input and feedforward 


speeding up the system response. On the other hand, if the system is Type 1 or 
higher, the steady-state value of control for a step will be zero, and the solution 
will simply give us N, = 0 and N,, which defines the desired value of the 


state, x . 
* 
Continuing on then, the steady state requirements for the system are that 


N,r = X, = Xs’ 
Hx,-y,-r, (8.70) 


which reduce to 
HNr-r and HN,-I (8.71) 
Furthermore, we are assuming the system is at steady state; therefore, 
x(k + 1) = $x(&) + Puck) > x, = Ox, + Tu, 

or 

(® — Dx, + Tu,, = 0; 
and, from Eqs. (8.69) and (8.70) 

(® — DN,r - I'N,r — 0, 
which reduces to 


(b — DN, TN, — 0. (8.72) 
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Collecting Eqs. (8.71) and (8.72) into one matrix equation 


FSDIPIUH 


yields the desired result 


N, $-1r]'[o 
Ix]-[^wel [1] is 
It is also possible to enter the reference input after the gain multiplication 
according to Fig. 8.15(b) by combining N, and N, according to 
N =N, +KN.,. (8.74) 


Calculation of N , and N, can be carried out by the MATLAB function refi.m 
contained in the Control Toolbox. 


@ Example 8.11 


Reference Input for the Mass-Spring System 


Compute the reference input quantities for the system of Example 8.3 where it is desired to 
command d to a new value. Compare the two structures (a) and (b) in Fig. 8.15. 


Solution. The state was definedtobex={d d y y]". Therefore, to command a desired 
valueofd, H, 2 [1 0 0 0], and the evaluation of Eq. (8.73) leads to NT zL 10 3 0 
and N, = 0, as expected by inspecting the physical system shown in Fig. A.8. The fact that 
N, = 0 makes sense because this system is in equilibrium without the need of any control as 
long as d = y, which is ensured by the resulting value of N,. In fact, N, will be zero for any 
system of Type 1 or higher, as already discussed. The elements of N, do not depend on specific 
values of any parameters in the plant and are therefore not sensitive to modeling errors of the 
plant. 

Use of Eq. (8.74) leads to N = 0.005 using the K from Eq. (8.18) and N = 0.011 using 
the K from Eq. (8.19). However, note that this input structure can be very sensitive to errors in 
K. In this particular example, N is the result of a difference of two numbers (K, and K,) that 
are close to each other in absolute value, extremely close for the first case (which also exhibited 
poor response), thus producing an extreme sensitivity. Specifically, if one of the elements of K 
in Eq. (8.18) were in error by 1%, the resulting error in N would be 120%! To avoid the high 
sensitivity for cases like this, it is advisable to structure the reference input as in Fig. 8.15(a). 


* 


This example shows that there are some systems where it is better to use the 
structure of Fig. 8.15(a). However, most cases do not exhibit this sensitivity and 
Fig. 8.15(b) is preferred due to its simplicity. 


14 This cannot be solved if the plant has a zero at z = L. 
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8.4.2 


Figure 8.16 

Block diagrams for the 
best reference input 
structure with 
estimators: the state 
command structure. 
(a) as derived, 

(b) simplified 


If the system in the example had been Type 0, we would have found that 
N, was nonzero and that its value was inversely proportional to the plant gain. 
However, the plant gain can vary considerably in practice; therefore, designers 
usually choose not to use any feedforward for Type O systems, relying instead on 
the feedback to keep errors acceptably small or by implementing integral control 
as discussed in Section 8.5. If this is the desired action for a control design using 
state space, the designer can simply ignore the computed value of N, and solely 
rely on N, for guidance on how to command the state vector to the desired values. 


Reference Inputs with Estimators: The 
State-Command Structure 


The same ideas can be applied to the case where the estimator is used to supply 
a state estimate for control in place of the actual state feedback. However, it is 
important to structure the system so that the control u that is applied to the plant 
is also applied to the estimator as shown in Fig. 8.16. This means that Eqs. (8.56) 
and (8.57) should not be used in the estimator box in Fig. 8.16 because they were 
based on control feedback that did not include the reference input. The basic 
idea of the estimator that resulted in the structure of Fig. 8.4 is to drive the plant 
model in the estimator with the same inputs that are applied to the actual plant, 


Peat 


(a) 


(b) 
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thus minimizing estimation errors. Therefore, the form of the estimator given by 
Eq. (8.23) should be used with u(k) as shown by Fig. 8.16, that is 


u(k) = —K(x(k) —x,) + N r = —Kx(k) + Nr, (8.75) 
or, with the current estimator, Eq. (8.33) is used with 
u(k) = -K(&(k) — xj) + N,r = -K&(K) + Nr, (8.76) 


or, for the reduced-order estimator, Eq. (8.45) should be used with 


uk) --[K, K,] | | p | -x, | +N,r=-[K, K,] | z | + Nr. 
° (8.77) 

Under ideal conditions where the model in the estimator is perfect and the 
input u applied to plant and estimator is identical, no estimator error will be 
excited. We use the feedback to the estimator through y only to correct for 
imperfections in the estimator model, input scale factor errors, and unknown 
plant disturbances. 

To analyze the response of a system, we must combine the estimator equations 
with the model of the system to be controlled. It is often useful to analyze the 
effect of disturbances, so the system equations (8.3) are augmented to include the 
disturbance, w, as 


x(k +1) = x(k) + Pu(k) +P w(k), 
y(k) = Hx(k) + Ju(k). (8.78) 


For the predictor estimator, Eq. (8.62) is augmented with the input command 
and disturbance as follows: 


x o -TK x FN r 
[alelan e-ren] llen eL pe 


(8.79) 
Note that the term on the right with r introduces the command input in an 
identical way to both the plant and estimator equations. The term on the right 
with w introduces the disturbance into the plant only; the estimator is unaware 
of it. 
It may be useful to inspect the performance of the system in terms of the de- 
sired output y,, the control u, and the estimator error X. This can be accomplished 
with the output equation 


y, (k) H, 0 
oe. || == | Or uate It) sg Im (8.80) 
X(k) B cr» 
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+ Example 8.12 


For the current estimator, the system eauations are found by combining 
Eqs. (8.57, (8.76), and (8.78), which yields 


x = o -FK x 
$ |, LLH® ®-TK-L Be || z | 
I'N 
«| IN 70+ l "un Jew. (8.81) 


Reference Input Command to Satellite Attitude 


Determine the state-command structure for the system whose regulator was found in Example 
8.8 and verify that its step response does not excite an estimator error. 


Solution. Evaluation of Eq. (8.73) yields N, = (1 0)” and N, = 0. Therefore, N=K m 
The desired response of the system is obtained from Eqs. (8.79) and (8.80). Using step in 
MATLAB yields the unit step responses shown in Fig. 8.17. Note that the estimator error 
remains zero; thus the response is exactly the same as if no estimator were present. Note also 
that the structure shown in Fig. 8.16(b) does not allow us to represent the system in a simple, 
classical manner with the D, (2) from Example 8.8 placed in the upper path as in Fig. 7.5, nor 
does it allow us to place D, (2) in the lower feedback path. Rather, it is best to stay with the 
state-space description and enter the equations in the control computer based on Fig. 8.16. The 
response in Fig. 8.17 would also have resulted if using no estimator, a current estimator, or a 
reduced-order estimator. 


Figure 8.17 
Step-response time 
histories for Example 
8.12 
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Outputs 
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8.4.3 


Figure 8.18 
Reference input as an 


output-error command 
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It is worthwhile to reflect on the fact that the combined system has poles that 
consist of the control and the estimator poles, as given by Eq. (8.55). The fact 
that the system response to an input structured as in Fig. 8.16(b) did not excite 
the estimator response means that the transfer function of this system had zeros 
that canceled the estimator poles. The determination of the structure in which 
the reference command is entered into a system can be viewed as one of “zero 
placement” and, in fact, it has been shown that it is possible to place the zeros of 
the closed loop transfer function at any arbitrary location [((Emami-Naeini and 
Franklin (1982)]. 


Output Error Command 


Another approach to the reference-input structure is to introduce the command 
only at the measured estimator input, as shown in Fig. 8.18. This solution is 
sometimes forced on the control designer because the sensor measures only the 
error. For example, many thermostats have an output that is the difference between 
the temperature to be controlled and the reference, or set-point, temperature. No 
absolute indication of the reference temperature is available to the controller. 
Likewise, some radar tracking systems have a reading that is proportional to the 
pointing error, and this signal alone must be used for control. In this case, the 
estimator equation (8.56) becomes 


x(k + 1) = (6 -TK — L,H)x+ L (y =r), 
and the system response can be determined by solving 


© -TK 0 
H =[ cn E +f Jo+[ 5 ] ow. 
k+l p P k p 
(8.82) 


re ee S in Output error 
Compensator, D(z} 
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Note that the command input r only enters the estimator; therefore, the plant and 
estimator do not see the same command and an estimator error will be excited. 


€ Example 8.13 


Output Command Structure with a Predictor Estimator 


Analyze the performance of Example 8.12 when using the output-command structure, specif- 
ically looking at the step response of the estimator error. 


Solution. The system is analyzed using Eq. (8.82) and the output Eq. (8.80). The result is 
shown in Fig. 8.19. Note the estimator error response and that it causes a substantial increase 
in overshoot as compared to that of Fig. 8.17 (about 70% rather than 20%) and an increased 
use of control. Although this degradation could be reduced with faster estimator-error roots, 
there are limits due to the adverse effect on the estimator's sensitivity to measurement noise. 
Some of the degradation can be reduced by using a current or reduced order estimator because 
of their quicker response from the immediate use of the measured signal. 


-+ 


€ Example 8.14 


Figure 8.19 

Output time histories for 
prediction estimator 
with output command, 
Example 8.13 


Output Command Structure with a Reduced-Order Estimator 


Analyze the performance of Example 8.13 when using the output-command structure and a 
reduced-order estimator. 


Outputs 


VS 
Time (sec) 


Figure 8.20 


Output time histories for 
reduced-order estimator 
with output command, 


Example 8.14 


8.4.4 
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Outputs 


Time (sec) 


Solution. The easiest way to analyze this system is by transform methods. The plant transfer 
function was analyzed in Section 4.3.1 and found to be 


T? (2+1) 

pRCEEUA 

and the transfer function that arose from the reduced-order estimator is given by Eq. (8.67). 
The output time histories using these transfer functions are shown in Fig. 8.20. Note again that 
there is more overshoot (about 50%) than shown in Fig. 8.17 but not as much as the prediction 
estimator. The control usage for this case is also less than in Fig. 8.19, and it starts at the 
first cycle rather than being delayed by one cycle, as was the case in Fig. 8.19. This result 
produced a settling time for the reduced-order case that is about one sample period faster than 
the prediction estimator. A current estimator would produce results similar to this case because 
it, too, shares the feature of an immediate response to the measured output. 


G(z) = 


* 


A Comparison of the Estimator Structure and 
Classical Methods 


This section has demonstrated a key result: The controller structure based on 
state-space design of the control and estimator exposes a methodology for the 
introduction of the reference input in a way that produces a better response 
than that typically used with transfer-function design. The state-command input 
shown in Fig. 8.16 provides a faster response with less control usage than the 


320 Chapter8 Design Using State-Space Methods 


€ Example 8.15 


output-error-command scheme shown in Fig. 8.18, which is the typical transfer 
function structure. The reason for the advantage is that the state-command struc- 
ture provides an immediate input to the plant and does not excite any modes of the 
compensator that degrade the response. Although this result has been specifically 
shown only for a simple second-order system, it applies to more complicated 
systems as well. In fact, for higher-order plants, higher-order compensators are 
often required with more modes for potential excitation. 

Itis not mandatory that the state-space representation and the state-estimator 
design approach be used in order to determine a structure that does not excite 
compensator modes. However, the determination is difficult using the transfer- 
function approach, especially in the MIMO case. 

The advantages of using the transfer-function representation are that high- 
order systems are easier to trouble shoot, the designs are made robust with less 
effort, and experimental frequency-response data can be used to close a loop 
quickly without a time-consuming modeling effort. These advantages might not 
always warrant transforming to the state-space form in order to achieve easily 
the better reference input structure. 

Although not discussed in this section using linear systems, the state- 
command structure allows for superior response of systems with nonlinear control 
characteristics, for example, saturation or on- off actuators. Whatever nonlinear- 
ity is present in the plant can usually be modeled to some degree in the estimator 
as well, thus reducing errors that would otherwise be excited by the nonlinearity. 


Compensation Design for a System with a Resonance 


A computer disk drive has a control system that commands the read head to move to specific 
tracks on the disk. One such system is described in Chapter 14. Although the dynamics 
between the torquer and the head motion are primarily G(s) — 1/s?, there are typically several 
resonances due to the arm's flexibility that limit the performance of the servo system. Here we 
wish to limit the flexibility to one resonance mode for simplicity. The transfer function is the 
same as the mass-spring system shown in Appendix A.4, although the derivation of it would 
involve rotational dynamics rather than the linear dynamics in Appendix A.4. (See Franklin, 
Powell, and Emami-Naeini (2019), Chapter 2. for more details.) 
The system transfer function is 


1 x 108 
s*(s? + fw s+ a») 
where the resonance frequency w, = | kHz and the damping ¢, = 0.05. Use a sample rate of 6 


kHz and design control systems that have a rise time of 10 msec with an overshoot less than 
15%. 


G(s) = 


(a) Do the design using a state estimator and the state command structure, 


(b) evaluate the K and L from (a) using the output error structure whether or not they meet 
the specifications, and 


Figure 8.21 
Disk read head response, 
Example 8.15 
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(c) doa classical design with a lead compensator. 


Iterate on whatever design parameters are appropriate for (a) and (c) to meet the design 


specifications. 


Solution. Equation (2.16) indicates that a t, < 10 msec would be met if w, > 180 rad/sec 
and the M, < 1596 would be met if ¢ > 0.5. 


(a) Forthe state space design, a good starting point would be with two of the desired poles at 


w, = 200 rad/sec and ¢ = 0.6. Two more poles also need to be selected for this 4'^ order 
system; so let's pick them at w = 1 kHz and ġ = 0.06. As discussed in Section 2.6, it is 
not wise to move poles any more than you have to. In this case, we have retained the same 
natural frequency and increased the damping slightly. For the estimator, a good starting 
point is to pick the natural frequencies twice those of the controller, that is, at 400 rad/sec 
and 2 kHz. The damping was selected to be 0.7 for both sets of poles in the estimator. 
The selected poles were converted to their discrete equivalents and used with acker to 
find K and L. Evaluation of the response for the state command structure is found using 
Eq. (8.79) in step and is shown in Fig. 8.21. We see that the rise time and overshoot 
specifications were met for this case and that the system settles to its final value at about 
35 msec. 


(b) Using Eq. (8.82) shows the response of the system with the same K and L for the output 


Output 


error command case and it is shown in the figure also. Note that the estimator excitation 
for this case caused an overshoot of 130% rather than the desired 15% and the system 
appears to be lightly damped even though its poles are in precisely the same place as 
case (a). It is interesting that the large overshoot was, in part, caused by using desired 
poles that increased the resonant mode damping from ¢ = 0.05 to 0.06. Had the damping 


2.5 


|O———— ŝtate command structure 
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been kept at 0.05, the overshoot would have been about 50% which is still excessive but 
significantly less. This illustrates the sensitivity of the system to the pole placement when 
using the output error command structure. 

(c) For the classical design, we note that the sample rate of 6 kHz is over 100 times faster 
than the desired closed loop natural frequency (200 rad/sec = 30 Hz). Therefore, the most 
expedient design method is to use the s-plane and convert the result to the discrete case 
when through. Furthermore, since the resonance frequency is significantly faster than 
the desired bandwidth (= 200 rad/sec), we can ignore the resonance for the first cut at 
the design and simply find a compensation for a l/s? system. Using frequency response 
ideas, we know that a lead compensation with a ratio of 25 between the zero and pole 
will yield a maximum increase in phase of about 60° (Fig. 2.17). We also know that a 
60° PM (Section 2.4.4) will translate to a damping ratio of about 0.6 which will meet 
the overshoot specification. For a 1/s? plant, the phase is 180° everywhere; therefore, the 
desired PM will be obtained if we place the lead compensation so that the maximum 
phase lead is at the desired crossover point (= 200 rad/sec). This is accomplished by 
placing the zero a factor of 5 below 200 (at 40 rad/sec) and the pole a factor of 5 above 
200 (at 1000 rad/sec), thus producing 

s+40 

s + 1000° 

Using this with the G(s) above in a Bode plot shows that the resonant mode does not 

affect the design significantly, the PM is met, and the desired crossover is achieved when 

K = 8000. To convert to the digital form, we invoke c2d using the matched pole-zero 

approach and find that 


D(s) = K 


z + 0.9934 

z + 0.8465 

The closed loop step response with D(z) in the forward path is found using step and 
shown in Fig. 8.21. It also meets the specifications, but a slow compensator mode was 
excited and the settling time of this system is considerably longer than the state command 
structure. The advantage of this approach is that the compensator is first order while the 
estimator approach (a) required a 4" order compensation. 


D(z) = 7394 


8.5 


* 


Integral Control and Disturbance Estimation 


Integral control is useful in eliminating the steady-state errors due to constant 
disturbances or reference input commands. Furthermore, most actual control 
systems are nonlinear and the input gain T and the state matrix ® vary with 
time and/or the set point. The linear analysis which is the subject of this book 
pertains to perturbations about a set point of the nonlinear plant and the control u 
is a perturbation from a nominal value. The use of integral control eliminates the 
need to catalog nominal values or to reset the control. Rather, the integral term 
can be thought of as constantly calculating the value of the control required at the 


8.5.1 


Figure 8.22 

Block diagram for 
integral control with 
full-state feedback 


8.5 Integral Control and Disturbance Estimation 323 


set point to cause the error to go to zero. For these reasons, some form of integral 
control is typically included in most control systems. More generally, the external 
signals frequently include persistent deterministic components and the control 
engineer is required to design a controller which will force the steady-state error 
to be zero in the presence of such signals. A particular case is that of the disk drive 
servo required to follow a data track that is slightly off center so the reference 
signal is sinusoidal. 

In the state-space design methods discussed so far, no mention has been made 
of integral control; nor have any of the design examples produced a compensation 
with an integral kind of behavior. In fact, state-space designs will not produce an 
integral action unless special steps are taken. There are two basic methods to force 
zero steady-state error in the presence of persistent signals: state augmentation 
also called internal signal model control and disturbance estimation. The idea 
of state augmentation for constant commands or disturbances was discussed for 
continuous systems in Section 2.6.5 and is essentially the same as the addition 
of an integral term that was discussed for transform design in Section 2.2.3. The 
generalization augments the state in a way that achieves zero steady-state error 
for a general class of reference and disturbance signals. Disturbance estimation 
provides the same effect based on estimation of the state of a model which could 
generate the external signals. We begin with the heuristic introduction of integral 
control. 


Integral Control by State Augmentation 


The idea is to add an integrator so as to obtain an integral of the error signal. This 
integrator will be physically implemented as part of the controller equations. We 
then feed back that integral along with the estimated or measured state as shown 
in Figs. 8.22 and 8.23 in a similar manner as before. To accomplish the design of 
the feedback gains for both the integral and the original state vector, we augment 
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Figure 8.23 

Block diagram for 
integral control with 
state estimation 


the model of the plant with an integrator, thus adding an error integral output to 
the existing plant state output. This augmented model is then used as before to 
calculate the feedback control gains for the augmented state. More specifically, 
to the standard system 


x(k + 1) = ®x(k) + Tu(k) +F w(k), 
y(k) = Hx(&), 


we augment the state with x,, the integral of the error, e = y — r. The discrete 
integral is simply a summation of all past values of e(k) (Eq. 3.15), which results 
in the difference equation 


x(k +1) =x,(k) + e(k) = x,(k) + Hx() — r (k), (8.83) 


therefore arriving at the augmented plant model 


x(k-)]| [1H x, (k) 0 am 
E oa E else «p ]«o- |, ]ie (8.84) 


The control law, following Eq. (8.75), is 


u(k) 2-[K, K] l UE l + KN r(k). 


With this revised definition of the system, the design techniques already developed 
can be used directly for the control law design. Following Figs. 8.15(a) and 
8.16(a), it would be implemented as shown in Fig. 8.22 for the full-state feedback 
case and as shown in Fig. 8.23 for the case where an estimator is used to provide 
X. The integral is replacing the feedforward term, N, and has the additional role 
of eliminating errors due to w. 

The estimator is based on the unaugmented model and is used to reconstruct 
the unaugmented state. It will be nth-order, where n is the order of the original 
system and requires the placement of n poles. On the other hand, the design of 


integrator pole 
cancellation 
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[K, K] requires the augmented system matrices (Eq. 8.84); therefore, there will 
be n + | poles to be selected for this portion of the design. 

If implemented as in Fig. 8.23, the addition of the extra pole for the integrator 
state element will typically lead to a deteriorated command input response com- 
pared to that obtained without integral control. While it is possible to iterate on 
the n + 1 selected control poles until a satisfactory response is obtained, it is also 
possible to retain the reference response obtained from a non-integral-control 
design by placing a zero in the controller as shown in Fig. 8.24 so that it cancels 
the extra pole from the integrator. Note that the feedforward N, term in Fig. 8.22 
has been replaced by N which introduces a zero at z,=1- bi 'S Using the zero 
to cancel the closed-loop pole that was added for the integral state element cancels 
the excitation of that pole by command inputs. Note that this does not cancel the 
integral action, it merely eliminates the excitation of the extra root by command 
inputs. A change in the disturbance, w, will also excite the integral dynamics and 
the steady-state errors due to either constant disturbances or constant command 
inputs are eliminated. As always, the integrator output changes until its input, 
which is constructed to be the system error, is zero. The configuration of Fig. 
8.24 can be changed to replace the feedforward of r to additional feedforward of 
e and modified feedback of y. 


€ Example 8.16 


Figure 8.24 


Integral Control for the Satellite Attitude Case 


Determine the integral control structure and gains for the satellite attitude control problem 
using full state feedback. Place the control poles at z = 0.8 + 0.25. 0.9 and use a sample 


Block diagram for integral control with full-state feedback and an added zero 


: K , , 
I5 1n fact, the system of Fig. 8.22 has a zero at 1 — z jb so the selection of the zero location corresponds 
to a particular selection of N. : 
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Figure 8.25 

Response of satellite 
example to a unit 
reference input att = 0 
and a step disturbance 
at t = 2 sec with no 
integral control action, 
Example 8.16 


period of T — 0.1 sec. Compute the time responses for a unit step in r at t = 0 sec and a 
step disturbance of 5 deg/sec? at t = 2 sec for (a) no integral control, (b) integral control as in 
Fig. 8.22, and (c) integral control as in Fig. 8.24 with the added zero at z = +0.9. 


Solution. 


(a) 


(b) 


(c) 


Outputs 


This case is the same controller as used in Example 8.12. The only difference is that there 
is a step in the disturbance at 2 sec. Therefore, Isim must be used in order to allow the 
multiple inputs, the step in r at £ = 0 and the disturbance step in w starting at 2 sec. The 
result is shown in Fig. 8.25. We see that the system responds identically to Fig. 8.17 for 
the first 2 sec, then there is a steady-state error in the output, x,, after the transients from 
the disturbance die out. The output error can be shown via the final value theorem to be 
w/K, = 0.5, thus the final value of x, is 1.5 instead of the commanded value of 1. 

A steady-state error resulting from a disturbance is a classic motivation for the addition 
of integral control. The system model from Example 8.1 is augmented according to 
Eq. (8.84) and used with acker to obtain the augmented feedback gain matrix, [K, K] = 
[1.005 13.74 4.313] by asking for control roots at z = 0.8 + j0.25, 0.9. We saw from 
Example 8.12 that N, — [1 0]; therefore, the system design is complete and can be 
implemented according to Fig. 8.22. Use of lsim produces the response in Fig. 8.26(a). 
Note that the desired result has been obtained in that there is no longer a steady-state error 
in the output, x,. However, also note that it has come with a price; the behavior before the 
disturbance step has been degraded. More control was used, and the initial overshoot has 
increased from the original 20% to about 40% because of the additional root at z = 0.9. 


The implementation is structured as shown in Fig. 8.24 which produces a zero at z — 0.9. 
All other parameters are the same as (b). Note that the resulting response in Fig. 8.26(b) 


Time (sec) 
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Figure 8.26 3 
Response of satellite 
attitude to a unit step 
input at t = O and a step 
disturbance at t = 2 sec 
with integral control, 
Example 8.16. (a) As in 
Fig. 8.22, (b) with an 
added zero as in 

Fig. 8.24 


Outputs 


mises w2 
QbEeeeeeeememHHmMHMMMMMHIHHMMMMeÜMe Ie Mee Pu 
e X, 
——. X2 
30 [ 2 3 4 5 6 


Time (sec) 


(a) 


Outputs 


Time (sec) 


(b) 


before 2 sec is now identical to the case in Example 8.12 with no integral control; yet, the 
integral action successfully eliminates the steady-state error. 


+ 
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8.5.2 


disturbance rejection 


disturbance modeling 


Figure 8.27 
Block diagram for input 
disturbance rejection 


It should be clear from the discussion and the example that the preferred 
implementation of integral control is given by Fig. 8.24 where the zero cancels 
the integrator closed-loop root. 


Disturbance Estimation 


An alternate approach to state augmentation is to estimate the disturbance signal 
in the estimator and then to use that estimate in the control law so as to force 
the error to zero as shown in Fig. 8.27. This is called disturbance rejection. 
This approach yields results that are equivalent to integral control when the 
disturbance is a constant. After the estimate, w, converges, the feedback of its 
value as shown in Fig. 8.27 will cancel the actual disturbance, w, and the system 
will behave in the steady state as if no disturbance were present. Therefore, the 
system will have no steady-state error, assuming, of course, that the steady-state 
error was due to a disturbance described by the assumed equation used in the 
estimator. It is important to notice that while a disturbance may, in general, appear 
at any point in the plant equations, the control can apply a signal to cancel it only 
at the control input. To reconcile these facts, we introduce the "input equivalent" 
disturbance. This is a virtual signal applied at the control input which would 
produce the same steady state output at y as the actual disturbance does. Then, 
when the control applies the negative of the virtual disturbance, the effect of the 
real disturbance at the output is cancelled and the error is driven to zero. To obtain 
an estimate of the virtual disturbance, we build the estimator with the equations 
of the virtual disturbance included. 

Disturbances other than constant biases can be modeled, included in the 
estimator equations, estimated along with the plant state, and their effect on 
errors eliminated in steady-state. If we assume the disturbance is a constant, the 
continuous model is quite simple: 


Pal. Hip Lp 
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A sinusoidal disturbance would have the model 
ü = —ww, 
or, in general, we could say that the disturbance obeys 
x = Ex, 
w =H,x,. 
and the discrete model is given by 
x(k + 1) = ®,x,(k) (8.85) 
w(k) = H,x,(k) (8.86) 


where ®, = e4" . For purposes of disturbance estimation, we augment the system 
model with the disturbance model, so Eqs. (8.85) and (8.86) become 


xk) ] [9 rH, ][ xd r 
Beal 0 "Es || oe (8.87) 
y=[H oi/* |. (8.88) 


which can be written as 


xl» |. x(k) 
| x, + 1) | -9, | x(k) ERZO 


x(k) 


In the particular case where the disturbance is a constant, these equations 


reduce to 
x(k +1) o^ r x(k) r 
mem NE : JE [e| 6 ew (8.89) 
»x2[H 0 I |. (8.90) 


All the ideas of state estimation in Section 8.2 still apply, and any of the 
estimation methods can be used to reconstruct the state consisting of x and x,, 
provided the system is observable. The computation of the required estimator 
gains is exactly as given in Section 8.2, the only change being that the system 
model is the augmented one given above by «b, and H. Note from Fig. 8.27, 
however, that the control gain matrix, K, is not obtained using the augmented 
model. Rather, it is obtained using the ® and T associated with the unaugmented 
F and G. In fact, the augmented system described by [®,,, H,,] will always be 
uncontrollable! We have no influence over the value of w by means of the control 


16 Observability requires that the virtual disturbance is "seen" at the plant ouput. Thus, if the disturbance 
is a constant, then the plant cannot have a zero from u to y at z = 1. 
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input, w, and must live with whatever value nature deals us; hence, the augmented 
system is uncontrollable. Our plan is not to control w, but to use the estimated 
value of w in a feedforward control scheme to eliminate its effect on steady-state 
errors. This basic idea works if w is a constant, a sinusoid, or any combination of 
functions that can be generated by a linear model. It works regardless of where 
the actual disturbance acts since the design is based on the virtual disturbance. 
The only constraint is that the disturbance state, x ,, be observable. 


€ Example 8.17 


Figure 8.28 

Response of satellite 
example to a unit 
reference input at t = 0 
and a step disturbance 
att = 2 sec with bias 
estimation as in 


Fig. 8.27, Example 8.17 


Bias Estimation and Rejection for the Satellite Attitude Case 


Determine the bias rejection control structure and gains for thesatellite attitude control problem. 
Place the control poles at z = 0.8 + j0.25 and use a sample period of T = 0.1 sec. Use a 
predictor estimator and place the estimator poles at z = 0.4 + j0.4, 0.9. Compute the time 
responses for a unit step in r at ! = 0 sec and a step disturbance of 5 Deg/sec at t = 2 sec and 
compare the results with the integral control in Example 8.16. 


Solution. For purposes of finding the control gain, we use the unaugmented model as in 
Example 8.1 and, therefore, find the same value of K — [10.25 3.49]. For purposes of 
designing the estimator, we augment the plant model according to Eqs. (8.89) and (8.90) and 
find that the desired poles yield L; =[1.3 6.14 5.2]. Structuring the control as in Fig. 8.27 
and applying the inputs as specified above, lsim yields the response as shown in Fig. 8.28. 
Note the similarity to Example 8.16 shown in Figs. 8.26(a) and (b). The disturbance rcjection 
approach also eliminates the steady-state error. But also note the early response to the reference 


Outputs 


Time (sec) 
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input: There is no increased overshoot as in Fig. 8.26(a); in fact, the response is identical to 
Fig. 8.26(b) up until the disturbance enters at ! = 2 sec. Note further that the disturbance 
estimate, w, approaches the actual disturbance value asymptotically. Notice that in this case 
the steady state error due to the reference input is made to be zero by the calculation of N and 
is not robust to small parameter changes in the way provided by integral control. 


* 


Example 8.17 shows that disturbance estimation can be used to estimate a 
constant disturbance input and then to use that estimate so as to reject the effect 
of the disturbance on steady-state errors. When the disturbance is a constant, this 
approach essentially duplicates the function of integral control. The following 
example shows how disturbance estimation can be used to estimate the value of 
the disturbance when it is a sinusoid. The estimate is used to cancel the effect of 
the disturbance, thus creating disturbance rejection. 


€ Example 8.18 


Disturbance Torque Rejection for a Spinning Satellite 


For a spinning satellite, a disturbance torque from solar pressure acts on the system as a 
sinusoid at the spin frequency. The attitude dynamics become 4th order as the two axes are 
now coupled; however, for slow spin rates, the dynamics may be approximated to be 1/5? as 
in Example 8.17. 

Determine the disturbance rejection control structure and gains for the attitude control 
with a disturbance torque from solar pressure of 2 deg/sec? where the satellite is spinning at 
15 rpm. Place the control poles at z = 0.8 + 0.25 and use a sample period of T = 0.1 sec, as 
before. Use a predictor estimator and place the estimator poles at z = 0.4 + j0.4,0.9+ j0.1. 
The location of the estimator poles corresponding to the disturbance can be selected at a 
relatively slow frequency as above if the sinusoidal disturbance is known to be relatively stable 
in magnitude and phase. By picking those estimator poles at a slow frequency, the disturbance 
estimate will not respond much to other higher frequency disturbances. 

Plot the time history of the disturbance, the estimate of the disturbance, and the system 
output to verify that, in steady-state, there is no error remaining from the disturbance. Put in a 
step command of I? at t = 5 sec to verify that the input will not excite any unwanted estimator 
errors. Examine the roots of the 8th order system and explain what each of them represent. 


Solution. The feedback for the unaugmented system is computed as in Example 8.17 to be 


K-[10.25 3.49]. 


The disturbance acts at the control input so there is no need for the concept of the virtual 
disturbance. It is modeled by choosing 


gei | Se j| C=" am Hg 0 | 
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in Eqs. (8.87) and (8.88). Use of acker with o, and H, and the desired poles results in 
L; = [1.375 6.807 8.391 — 6.374]. 


The time response of the system described by Fig. 8.27 is found by use of Isim where the 
state of the complete system consists of the augmented state as well as the estimate of the 
augmented state, an 8th order system. The feedback of w can be accomplished by using an 
augmented feedback gain K' = [K 1 0J. Figure 8.29 shows the results. Note in the figure 
that the estimate takes about 4 sec to converge to the correct value and that there is a noticeable 
error in the output due to the disturbance until that time. The step at 5 sec has no effect on the 
estimate quality and therefore the response to the step is precisely as it was originally designed. 
Without the disturbance rejection. there would have been a steady sinusoidal error of about 
0.2? superimposed on the output. 
The roots of the closed loop 8th order system are: 


z=08+0.25j, 04404j, 0.90.lj, 0.9881 0.156j. 


The first 6 roots represent those selected in the control and estimation design. The last two 
represent the discrete equivalent of the pure oscillation at 15 rpm, which are unchanged by the 
feedback. 


Figure 8.29 

Response of satellite to a 
sinusoidal input 
disturbance with 
disturbance rejection as 
in Fig. 8.27, Example 
8.18 


* 


The previous example had a sinusoidal disturbance torque acting on the input 
to the system. It is also possible to have a measurement or sensor error that is 
sinusoidal in nature that one would like to eliminate as a source of error to the 


Outputs 
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Time (sec) 


€ Example 8.19 


Figure 8.30 
Block diagram for sensor 
disturbance rejection 
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control system. The role of the estimator in this case is to use the contaminated 
measurement to reconstruct the error-free state for use in the control. The estimate 
of the sensor error is ignored by the controller. 


Satellite Attitude Control with Sinusoidal Sensor Disturbance Rejection 


When an attitude sensor on a spinning satellite is misaligned from a principal axis of inertia, 
the sensed attitude will have a sinusoidal component as the satellite naturally spins about its 
principal axis. Typically, it is desirable to spin about the principal axis; therefore, it is useful 
to estimate the magnitude and phase of the misalignment and to reject that component of the 
measurement. 

Repeat the design from Example 8.18, but replace the sinusoidal disturbance torque 
with a sinusoidal measurement error of 0.3° at 15 rpm. Again include a step command of 
1° at / = 5 sec. Place the control poles at z = 0.8 + j0.25, as in Example 8.18, and use a 
sample period of T = 0.1 sec. Use a predictor estimator and place the estimator poles at 
z — 0.8 + j0.2, 0.95 + j0.05. 


Solution. Asin Example8.18, the feedback for the unaugmented system is K — [10.25 3.49]. 
The output disturbance is modeled by augmenting the continuous plant with the matrices 


TETI SEE and H,=[1 0] 


and the augmented continuous system is 


x = Fx + Gu 
x, =F jx, (8.91) 
y=(HH,]. 


From these matrices, use of c2d will compute the discrete matrices ® and H, from which, 
with the desired poles the estimator gain is computed as 


L; = [0.3899 0.1624 0.0855 0.7378]. 


The time response of the system described by Fig. 8.30 is found by use of Isim where the 
state of the complete system consists of the augmented state as well as the estimate of the 


Estimator 
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reference following 


Figure 8.31 

Response of satellite to a 
sinusoidal sensor 
disturbance with 
disturbance rejection as 
in Fig. 8.30, Example 
8.19 


augmented state, an 8th order system. The estimate of v is ignored. Figure 8.31 shows the 
results. Note in the figure that the estimate takes about 3 sec to converge to the correct value 
and that there is a noticeable error in the output due to the measurement error until that time. 
The step at 5 sec has no effect on the estimate quality and therefore the response to the step 
is precisely as it was originally designed. Without the disturbance rejection, there would have 
been a steady sinusoidal error of about 0.2? as the controller attempted to follow the sinusoidal 
measurement. 


* 


A final example of the use of the estimator to achieve zero steady state 
error arises when it is desirable to track a reference signal with as little error as 
possible and it is known that the signal follows some persistent pattern. Since it 
would usually take some control effort to follow such a signal, the system would 
normally exhibit a following error of sufficient magnitude to produce the required 
control effort. This following error can be eliminated if the systematic pattern 
can be modeled and estimated, then used in a feedforward manner to produce the 
desired control effort. This is called reference following. The idea is the same as 
with disturbance rejection except that the error is now not the output only but the 
difference between the reference and the output. The idea again is to construct a 
virtual reference, p, at the control input which would produce the system error 
at the plant output, as shown in Figure 8.32(a). The feedback gain K is designed 


Outputs 


jenen x, + vi measured output, y' = 
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Figure 8.32 

Block diagram for sensor 
disturbance following. 
(a) The pretend model, 
and (b) the 


implementation model 
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using the unaugmented plant described by ® and I’, but the feedback signal is 
the system error e. 


By estimating o and feeding that estimate into the plant with the control as 


shown in Fig. 8.32(a), the effect of p is eliminated in this virtual model and e = 0 
in steady state. The actual situation that is implemented is shown in Fig. 8.32(b) 
where the reference is subtracted from the output to form the error, e, and the 


(a) 


Dyn Hos Lp 


(b) 
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estimate of the virtual reference, P, is subtracted from the control. Therefore, e 
will behave as if o was canceled, so that in steady state, e = 0, implying that y is 
following r. 


€ Example 8.20 


Sinusoidal Sensor Disturbance Following of a Disk Drive 


A computer disk drive read head must follow tracks on the disk. Typically, there is a small 
offset between the center of the tracks and the rotational center of the disk, thus producing a 
wobble in the tracks that the read head must follow. Assume the transfer function between the 
input command and the read head response is 


1000 
G(s) = = 


and the disk spins at 5000 rpm. Design a control system that follows tracks on the disk with 
no error even though they may be off center by some small amount. Pick the gains so that the 
system has a rise time better than 3 msec and no more than a 15% overshoot. Use a sample 
rate of 2 kHz. 


Solution. The disk head system is described by 


0 1 0 
Es [5 4|: =| o |: and’ A [3 9. ]. 


We pretend that the disturbance is the same as in Example 8.18, that is 


z-| 2, T and Hase [^ 0 | 


where w, = 5000 rpm or 523.6 rad/sec. The difference here compared to Example 8.18 is that 
the actual reference is being subtracted from the output and we wish to eliminate its effect on 
e. In other words, we want y to follow r in Fig. 8.32(b). 

Using F and G with c2d yields ® and F for the disk head dynamics. To meet the rise 
time and overshoot specifications, a few iterations show that poles with a natural frequency 
of 1000 rad/sec and a 0.6 damping ratio, converted to the digital domain, produce the desired 
response. The resulting feedback gain is 


K = [736.54 1.0865]. 
But we wish to include the virtual reference in the control as 
u = Kx — p. 


The estimator is constructed using the augmented system. A satisfactory design was found 
with equivalent s-plane poles that had natural frequencies of 1000 and 1200 rad/sec, both sets 
with 7 = 0.9. Use of acker with €», and H, results in LT = [3.9 10129 —2.4 553.0]. 

The time response of the system described by Fig. 8.32(b) is found by use of Isim where 
the state consists of the augmented state as well as the estimate of the augmented state, an 8* 
order system. 

The purpose of the feedforward is to provide a control that will produce y = r and thus 
e= 0. Since we have an estimate of an input, Ø, that will produce the track wobble which is 
the tracking reference, r, feedforward of —p causes y = —r. This produces the time response 


Figure 8.33 


Response of disk drive to 
sinusoidal reference 
following as in Fig. 8.32, 


Example 8.20 
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000000 : e, with command following 
: r, reference 


Outputs 


15 
Time (msec) 


marked by small circles in Fig. 8.33. Note that, once p has converged, there is no noticeable 
following error. 


8.6 


* 


Effect of Delays 


Many feedback control systems have a pure time delay, 4, imbedded in some 
part of the loop. A one cycle delay was analyzed in Section 7.3.2 by adding a z^! 
to the system model and using the root locus method to show that the stability 
of the system was decreased when no change to the compensation was made. 
We can also analyze the effect of delays with frequency response methods as in 
Section 7.4 by reducing the phase by wà. This analysis easily shows that, if no 
changes are made to the compensation, the phase margin and hence stability will 
be reduced. Either design method can be used to modify the compensation so the 
response is improved. 

For state-space design, we saw in Section 4.3.4 that, for an actuator delay, 
one state element must be added to the model for each cycle of delay or fraction 
thereof. In other words, a delay of 0 < A < T will require an increase in the order 
of the system model by 1, a delay of T < à < 2T will increase the order by 2, 
and so on. Using the pole placement approach, we can assign any desired pole 
locations to the system. Therefore, we are able to achieve the same closed-loop 
poles in a system with delays as one without delays; however, there are extra 
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8.6.1 


delay model 


Figure 8.34 
System with delays, 
(a) sensor delay, 

(b) actuator delay 


poles in the system with delays. Those extra poles can slow down the response 
even if they are selected as fast as possible, that is, at z = 0. The response of the 
system to command inputs and disturbances is also affected by the location of the 
delays, that is, whether they are part of the control actuator or the sensor. Figure 
8.34 shows systems with the delays in the two locations. 


Sensor Delays 


For the sensor delay, Fig. 8.34(a), an estimator can be used to reconstruct the 
entire state; therefore, the undelayed state is available for control. The system can 
be made to respond to command inputs, r, in exactly the same way that a system 
would respond without a sensor delay because the estimator sees the command 
through the feedforward of u and does not depend on the delayed output to detect 
it. Therefore, no estimator error is excited and the undelayed state estimate is 
accurate. On the other hand, a disturbance input, w, will not usually be seen by 
the estimator until the output, y,, responds; therefore, the estimator error will be 
increased by the delay. 


Estimator 


(b) Actuator delay 
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The model of a one cycle delay of a quantity y is 
yy (k + 1) = yk), (8.92) 


where y,, is the delayed version of y and is an additional state element that is 
added to the system model. The model for more than one cycle can be obtained 
by adding more similar equations and state elements. So, for two cycles of delay, 
we would add 


You (k + 1) = x, (&), (8.93) 


to Eq. (8.92), where y,, is one more state element and is the value of y that is 
delayed by two cycles. Therefore, for a system given by 


x(k 4-1) = x(k) + Pu(k) 
y(k) = Hx(&), 


the system model including a two-cycle sensor delay is 


x(k + 1) ® 0 0 x(k) r 
y,(--1) | =| HOO valk) | +] O | uk), (8.94) 
yy (k + 1) 0 10 y, (k) 0 
x(k) 
yer=(0 OL] | teh |. (8.95) 
Yogk) 


where y, is the output y delayed by two cycles. Any number of cycles of delay 
can be achieved easily by using this scheme. 

If a sensor had delay that was not an integer number of cycles, it would not 
influence the sampled value until the next sample instance. Therefore, sensor 
delays must be an integer number of samples. 

Note that, due to the column of zeros, the augmented system matrix, ®, 
in Eq. (8.94) will always be singular, a fact that will cause difficulties when 
calculating gains for a current estimator using acker in MATLAB." 


€ Example 8.21 


Effect of a Sensor Delay on the Satellite Attitude Response 


Examine the step response to a command input r of the satellite attitude control with one cycle 
of delay at the sensor. Place two of the control poles at z = 0.8 + 0.25, as was the case for 
Examples 8.2 and 8.12, and the additional pole for the delay state at z = 0. Place the poles for 
a prediction estimator at z = 0.4 + 0.4j, as was the case for Examples 8.5 and 8.12, and the 
additional pole for the delay state at z — 0. Compare the results with Example 8.12, the same 
System step response without a delay. 


17 We will see in Chapter 9 that a singular «D matrix will also cause difficulty with dlqr. 
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Solution. The state-space equations for this system are (from Eqs. 8.94 and 8.95) 


xk +1) L TD x, (k) 12 
x(k-1 |2|010 x(k) |+| T | ulk), 
yy (D i o9 0 yy (k) 0 
x (k) 
yk) =[0 0 1]! x% 
Vg) 


Use of acker in MATLAB with the augmented ® and I’ matrices as defined above with the 
desired control pole locations yields 


K —[10.25 3.4875 Oj. 


The first two elements are the exact same values that were obtained in Examples 8.2 and 8.12, 
and the zero third element means that the delayed state element is ignored. Thus the sensor 
delay has no effect on the controller. 

Use of acker in MATLAB with the augmented ® and H matrices as defined above with 
the desired estimator pole locations yields 


1:72: 
I| 5: 
1.2 


In evaluating the step response according to Eq. (8.79), N needs to be evaluated with the 
new values of NI and N,. In this case, Eq. (8.73) shows that 


Ni] js N 20, and N = 10.25. 


x u 


The use of step with the system defined by Eq. (8.79) produces the step response shown in 
Fig. 8.35. It includes the output y and the delayed output y,. Note that the response of y is 
precisely the same as the response for Example 8.12 in Fig. 8.17 even though the only quantity 
fed back was the delayed output. This result follows because the current value of the state was 
estimated correctly due to the feedforward from the command input and the current value of 
the state was used by the controller. 


* 


It is remarkable that a system with a sensor delay can respond to a command 
input in precisely the same way that the undelayed system would respond. This 
implies that the system closed-loop transfer function was such that the pole at 
z = 0 was not excited. The ability to design such a control system is enhanced 
by the state-space controller/estimator approach. It is important to note, however, 
that if the system had a disturbance, w, that was not fed into the estimator in the 
feedforward path, the delayed output measurement would cause a delayed esti- 
mator response and, thus, a degraded response of the system to that disturbance. 


Figure 8.35 

Step response for input 
command for Example 
8.21 


application to engine 
control 


8.6.2 


8.6 Effect of Delays 341 


Outputs 


S p N i ee 
0 0.5 l 1.5 2 2:8 3 
Time (sec) 


An example of a system in common use with a sensor delay is the fuel 
injection control for an automobile engine. Here, the sensed value of the fuel-air 
ratio in the exhaust is delayed by the piston motion and the time for the exhaust 
stream to reach the sensor. The primary disturbance is the motion of the throttle 
by the driver's foot; however, this motion can be sensed and used as feedforward 
to the estimator. Thus the estimator structure is capable of instantaneous rejection 
of the throttle disturbance in spite of the significant delay of the exhaust sensor. 
(See Fekete, 1995.) 


Actuator Delays 


The model for an actuator delay is derived in Section 4.3.4. The result is Eq. 
(4.79), which allows for any length delay, fractional or whole cycles. The poles 
of asystem with actuator delays can be placed arbitrarily, just like the sensor delay 
case, and it makes sense to add a pole at z — 0 for each added delay state element. 
However, for input commands or disturbance inputs, the extra poles at z = 0 will 
be excited and the system response will be delayed by the actuator delay. This 
fact can be understood by examining Fig. 8.34(b). There is no way that the input 
r can influence the plant without passing through the delay. A disturbance w will 
influence the plant without delay; however, the feedback through the estimator 
will be delayed before it can counteract the disturbance, thus causing an increased 
sensitivity to disturbances. 
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€ Example 8.22 


Effect of an Actuator Delay on an Engine Speed Governor 


An engine speed governor consists of an rpm sensor, usually a device that counts the time 
between the passage of teeth on a gear, and logic to determine the input to the fuel injector 
(diesel engines) or to the throttle actuator or spark advance (gasoline engines). Figure 8.36 
shows the block diagram including the engine model, which is a simple first order lag with 
a time constant of around 3 sec so that a — 0.3. Fig. 8.36(a) shows the system without any 
delay, the ideal case. The actuator delay shown in Fig. 8.36(b) varies from engine to engine. 
For a gasoline engine with a throttle actuator, it is typically around a half engine cycle due to 
the piston motion (100 msec at 600 rpm). For diesel engines the actuator can be made to act 
more quickly; however, some manufacturers use injectors that have a full engine cycle delay 
(200 msec at 600 rpm). For this example, we will use a sample time of T — 200 msec and a 
one cycle actuator delay. 

Investigate the response of the system for step input commands, r, and impulsive distur- 
bances, w. Compare the results for control systems structured as in the three cases in Fig. 8.36. 
Pick the control gain so that the poles are at z — 0.4 for the ideal case and the estimator gain 
(if there is one) so that its pole is at z = 0.2. Select Ñ so that there is no steady-state error. 


Solution. The step responses are shown in Fig. 8.37. For the ideal case (Fig. 8.36(a)), the 
gains were found to be 


K = 9.303 and N = 10.303. 


The step response for r = 1 is shown in Fig. 8.37(a) and was obtained using step. 
The model of the system with the delay in Fig. 8.36(b) can be accomplished with the aid 
of Eq. (4.79). For this system it reduces to 


xcd [pe TP x(k) 0 r 
[naso] [Le o llaw ltl: JoL Jeo. 
(k) 

()=[1 0 È ih 
y [ ] u,(k) 


where ® and I can be found using MATLAB’s C2d, but can also be computed easily for this 
first order case to be 


=e" and Tr21-e"T, 


Figure 8.37(b) shows the response of this system with the classical feedback as shown 
in Fig. 8.36(b) using the same gains as in (a). Not only is the response delayed by a cycle, 
but it has become oscillatory. A z-plane analysis of this system explains why: the roots are at 
z = 0.47 x: 0.57 j, which yield an equivalent s-plane damping of ¢ = 0.33. 

Figure 8.37(c) shows the response of the system with the estimator structure as shown in 
Fig. 8.36(c) using thc desired control pole locations of z — 0.4, 0. This yielded gains of 


K =[8.76 0.54] and N = 10.303. 


The estimator poles were placed at z = 0.2, 0, which yielded an estimator gain of 


0.742 
i "WP 


Figure 8.36 

Engine speed control 
block diagrams, (a) the 
ideal feedback system 
without a delay and 
without an estimator, 
(b) the classical feedback 
with an actuator delay, 
(c) the estimator 
structure with an 
actuator delay 
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(b) 


Sensor 


Estimator 


Delay + Plant 
Model 


(c) 


The response was obtained using Eq. (8.79) and is identical to the ideal case; however, it is 
delayed by one cycle. As previously discussed, this delay cannot be reduced if its source is 
due to the actuator. 

The oscillatory response shown in Fig. 8.37(b) also can be eliminated by reducing the gain 
in the classical structure of Fig. 8.36(b), an easier solution than adding an estimator. Reducing 
the gain, K, from 9.3 to 4.0 produced the response in Fig. 8.37(d). While the oscillations have 
been removed, the response has slowed down. There is no value of gain in this simple structure 
that will duplicate the quality of the response obtained from the estimator system, although the 
difference is not large for this example. 

Three of the cases were analyzed for an impulsive disturbance, w and shownin Fig. 8.38(a, 
b, and c). Qualitatively, the same kind of responses were obtained; however, the predictor 
estimator had an extra cycle of delay compared to the classical feedback system; therefore, 
there is little difference between the responses in Figs. 8.38(b) and 8.38(c). A current estimator 
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Figure 8.37 
Responses to a step 
input, r, for Example 
8.22. (a) Ideal case, no 
delay (K = 9.3), 

(b) classical feedback 
with delay (K = 9.3), 
(c) predictor estimator 
with delay, (d) classical 


Outputs 


feedback with delay (a) Time (sec) 
(K = 4.0) 
6 6 
(c) Time (sec) 
Figure 8.38 
Responses to impulsive 
disturbance, w, for 
Example 8.22. (a) Ideal 3 2 
case, no delay (K 29.3, $ El 
(b) classical feedback o z 
with delay (K = 4), 
(C) predictor estimator 
w/delay (K = 9.3), 
(d) current estimator 
w/delay (K — 9.3) 
g g 
& & 
ő 6 


(c) Time (sec) (d) Time (sec) 
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formulation eliminates the extra delay and improves the response, which was obtained by 
evaluating Eq. (8.81) and is shown in Fig. 8.38(d). 


8.7 


controllability definitions 


+ 


Example 8.22 shows that an actuator delay will effect the response to a 
command input or a disturbance regardless of the control implementation. This 
is an inherent characteristic of an actuator delay for any system. However, use of 
an estimator including the delay minimizes its effect. Use of a current estimator 
provides the best response to a disturbance. 


*Controllability and Observability 


Controllability and observability are properties that describe structural features 
of a dynamic system. ° These concepts were explicitly identified and studied by 
Kalman (1960) and Kalman, Ho, and Narendra (1961). We will discuss only a 
few of the known results for linear, constant systems that have one input and one 
output. 

We have encountered these concepts already in connection with design of 
control laws and estimator gains. We suggested in Section 8.1.3 that if the matrix 
C given by 


C - [T : or: ...:@"'T] 


is nonsingular, then by a transformation of the state we can convert the given 
description into the control canonical form and construct a control law such that 
the closed-loop characteristic equation can be given arbitrary (real) coefficients. 
We begin our discussion of controllability by making the definition (the first of 
three): 


I. The system (®, I) is controllable if for every nth-order polynomial o (z), 
there exists a control law u = —Kx such that the characteristic polynomial of 
® -TK is æ (z). 


And, from the results of Section 8.1.3, we have the test: 


The pair (®, I) is controllable if and only iftherankofC = [F : 4T : -.- ro 
is n. 


The idea of pole placement that is used above to define controllability is essentially 
a z-transform concept. A time-domain definition is the following: 


18 This section contains material that may be omitted without loss of continuity. 
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II. The system (®, I) is controllable if for every x, and x, there is a finite N and 
a sequence of controls u (0), u(1), - - -, 4(N) such that if the system has state Xo 
at k = 0, it is forced to state x, atk = N. 


In this definition we are considering the direct action of the control u on the state 
x and are not concerned explicitly with modes or characteristic equations. Let us 
develop a test for controllability for definition II. The system equations are 


x(k + 1) = x(k) + Fu(X), 
and, solving for a few steps, we find that if x(0) = x,, then 


x(1) = Px, + l'u(0), 
x(2) = ®x(1) + Fu(1) 
= dx, + ru (0) + Fu(1), 


: : N-1 
x(N) = "x, + 3 gU Pu(j) 


j=0 
u(N — 1) 
= "y + (PF: OP: -1^ 'T] 
u(0) 
If x(N) is to equal x,, then we must be able to solve the equations 
u(N — 1) 

MM u(N — 2) n 

[F: 9T: ... :9" T] =x, - "x 
u(0) 


We have assumed that the dimension of the state, and hence the number of rows 
of the coefficient matrix of these equations, is n; the number of columns is N. 
If N is less than n, we cannot possibly find a solution for every x,. If, on the 
other hand, N is greater than n, we will add a column ®"T, and so on. But, by 
the Cayley-Hamilton theorem, (see Appendix C), ®" is a linear combination of 
lower powers of ®, and the new columns add no new rank. Therefore we have a 
solution, and our system is controllable by definition II if and only if the rank of 
C isn, exactly the same condition as we found for pole assignment! 
Our final definition is closest to the structural character of controllability. 


III. The system (6b, I) is controllable if every mode in is connected to the 
control input. 


Because of the generality of modes, we will treat only the case of systems for 
which ® can be transformed to diagonal form. (The double-integrator model 
for the satellite does not qualify.) Suppose we have a diagonal ®, matrix and 
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Figure 8.39 

Block diagram for a 
system with a diagonal 
-matrix 


corresponding input matrix I’, with elements y,. Then the structure is as shown 
in Fig. 8.39. By the definition, the input must be connected to each mode so that 
no y, is zero. However, this is not enough if the roots À, are not distinct. Suppose, 
for instance, A, = A,. Then the equations in the first two states are 


wi(k+1)=A,w (k)+ yu, 
w, (k + 1) = Aw, (k) + yu. 
If we now define & = y,w, — y,w,, the equation in & is 
yw, (k + 1) — y w,(k + 1) 2 A y w, (k) — A y ws (k) - Y YM — Vj Yu 
which is the same as 
E(k + 1) =A,&(k). 


The point is that if two characteristic roots are equal in a diagonal ®, system with 
only one input, we effectively have a hidden mode that is not connected to the 
control, and the system is not controllable. Therefore, even in this simple case, 
we have two conditions for controllability: 


1. All characteristic values of ®, are distinct, and 
2. No element of F, is zero. 


Now let us consider the controllability matrix of this diagonal system. By 
direct computation, we obtain 


Yı Y uM 


Vor. vue £ ; 
Ya Y^. Ys 
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y, 0 0 | X m- 
0 y, l As 4i 
0 Y, L we TE 


The controllability matrix is a product of two terms, and C is nonsingular if and 
only if each factor is nonsingular. The first term has a determinant that is the 
product of the y,, and the second term is nonsingular if and only if the A, are 
distinct! So once again we find that our definition of controllability leads to the 
same test: The matrix C must be nonsingular. If C is nonsingular, then we can 
assign the system poles by state feedback, we can drive the state to any part of 
the space in finite time, and we know that every mode is connected to the input. 

As our final remark on the topic of controllability we present a test that is 
an alternative to testing the rank (or determinant) of C. This is the Rosenbrock- 
Hautus-Popov (RHP) test [see Rosenbrock (1970), Kailath (1979)]. The system 
(®, D) is controllable if the system of equations 


v[zI — 6:T] 2 0' 


has only the trivial solution v' = 0’, or, equivalently 


rank{zI — 6 :T] =n, 
or there is no nonzero v' such that 
() v zzv, (i) vr 20. 


This test is equivalent to the rank-of-C test. It is easy to show that if such a v 
exists, then C is singular. For if a nonzero v exists such that vT = 0 by (i), then, 
multiplying (i) by I on the right, we find 


ver —zvT =0. 
Then, multiplying by 4T, we find 
v®T - zv. eT - 0, 


and so on. Thus we derive v'C— 0' has a nontrivial solution, C is singular, and 
the system is not controllable. To show that a nontrivial v' exists if C is singular 
requires a bit more work and is omitted. See Kailath (1979). 

We have given two pictures of an uncontrollable system. Either the input 
is not connected to a dynamic part physically or else two parallel parts have 
identical characteristic roots. The engineer should be aware of the existence of a 
third simple situation, illustrated in Fig. 8.40. Here the problem is that the mode 
az= 1 appears to be connected to the input but is masked by the zero in the 
preceding member; the result is an uncontrollable system. First we will confirm 


19 Of course, we showed this only for ® that can be made diagonal. The result is true for general «b. 


Figure 8.40 

Block diagram of a 
simple uncontrollable 
system 


observability 
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this allegation by computing the determinant of the controllability matrix. The 


system matrices are 
+i 1 l 
o = a , r = ? 
ial 54 


c=(r en-[i |]. 


1 
2 


and 


which is clearly singular. If we compute the transfer function from u to x,, we 
find 


H(z) = H — (8.96) 
—lz-} 
= : a (8.97) 
z—1 


Because the natural mode at z — A disappears, it is not connected to the input. 
Finally, if we consider the RHP test, 


-1 -I 1 
[zI - o n-|* (Racks |]. 
2 


then we must test the rank of 


lEn 
A F9 


which is clearly less than two, which means, again, uncontrollable. In conclusion, 
we have three definitions of controllability: pole assignment, state reachability, 
and mode coupling to the input. The definitions are equivalent, and the tests for 
any of these properties are found in the rank of the controllability matrix or in 
the rank of the input system matrix [z1— I7. 

We have thus far discussed only controllability. The concept of observability 


andlet z = $, 
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observability definitions 


is parallel to that of controllability, and most of the results thus far discussed 
can be transferred to statements about observability by the simple expedient of 
substituting the transposes $^" for ® and H” for I. The result of these substi- 
tutions is a “dual” system. We have already seen an application of duality when 
we noticed that the conditions for the ability to select an observer gain L to give 
the state-error dynamics an arbitrary characteristic equation were that (7, H^) 
must be controllable—and we were able to use the same Ackermann formula for 
estimator gain that we used for control gain. The other properties that are dual to 
controllability are 


OI. The system (®, H) is observable if for any n th-order polynomial @,(z), 
there exists an estimator gain L such that the characteristic equation of the 
state error of the estimator is o, (z). 


OII. The system (®, H) is observable if for any x(0), there is a finite N such 
that x(0) can be computed from observation of y(0), y(1), ..., y(N — 1). 


OIII. The system (65, H) is observable if every dynamic mode in ® is connected 
to the output y via H. 


We will consider the development of a test for observability according to 
definition OII. The system is described by?? 


x(k + 1) = ®x(k), x(0) =X), 
y(k) = Hx(&); 


and successive outputs from k = 0 are 


y(0) = Hx,, 
y(1) = Hx(1) = H®x,, 
yQ) = Hx(2) = Héx(1) = Hd*x,, 


y(N — 1) = H®*"'>,. 


In matrix form, these equations are 


H 
y(0) Hj 
: = . Xo. 
y(N aa 1) Ho”! 


20 Clearly the input is irrelevant here if we assume that all values of u (k) are available in the computation 


of x,. If some inputs, such as a disturbance w, are not available, we have a very different problem. 
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As we saw in the discussion of state controllability, new rows in these 
equations cannot be independent of previous rows if N > n because of the 
Cayley-Hamilton theorem. Thus the test for observability is that the matrix 


H 
Ho 


Ho"! 


must be nonsingular. If we take the transpose of © and let H” = T and ®’ = 4, 
then we find the controllability matrix of (®, I), another manifestation of duality. 


8.8 Summary 


e For any controllable system (®,I) of order n, there exists a discrete full state 
feedback control law (K ) that will place the n closed-loop poles at arbitrary 
locations. acker.m or place.m using ®,F perform this function. 


e. C=|[I_®F...], the controllability matrix, must be of rank n, the order of 
the system, for the system to be controllable. ctrb.m performs this calculation. 


e The general rule in selecting the desired pole locations is to move existing 
open-loop poles as little as possible in order to meet the system specifications. 


e For any observable system (®,H ) of order n, there exists a discrete estimator 
with gain L that will place the n estimator error equation poles at arbitrary 
locations. acker.m or place.m using 9" ,H* calculates L. 


. O-z[H H®...], the observability matrix, must be of rank n, the order of 
the system, for the system to be observable. obsf.m performs this calculation. 


e Feedback via K using the estimated state elements results in system poles 
that consist of the n control poles plus the n estimator poles. 


e Estimator poles are usually selected to be approximately twice as fast as the 
controller poles in order for the response to be dominated by the control 
design. However, in order to smooth the effects of measurement noise, it is 
sometimes useful to select estimator poles as slow or slower than the control 
poles. 


e Calculation of N, and N, via refi.m and their usage with the structure in 
Figs. 8.15 or 8.16, the state command structure, provides the best response 
to command inputs. 


* Integral control is achieved by implementing the desired error integral and 
including this as part of an augmented plant model in calculating K. The 
estimator is based on the non-augmented model. Integral control eliminates 
steady state errors due to command inputs and input disturbances. 
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* Disturbance estimation is accomplished by augmenting the model used in 


the estimator to include the unknown disturbance as a state element. The 
disturbance could be an unknown constant or a sinusoid with unknown 
magnitude and phase. The disturbance estimate can then be used in the 
control law to reject its effect, called disturbance rejection or to cause the 
system to track the disturbance with no error, called disturbance following. 
Delays in the actuator or sensor can be modeled in the estimator so that no 
estimation errors occur. Sensor delays will cause a delay in the response 
to a disturbance but there need not be any delay in the response to com- 
mand inputs. However, actuator delays will cause a delayed response from 
command inputs and disturbances. 


8.9 Problems 


8.1 


8.2 


8.3 


8.4 


For the open-loop system 

ME eae 

u(s) s? +0.2s+1' 

(a) Find the discrete state-space representation assuming there isa ZOH and the sample 
period T = 0.5 sec. 


G(s) = 


(b) Find the full state digital feedback that provides equivalent s-plane poles at w, = 2 
rad/sec with £ = 0.5. 


(c) Examine the response of the closed-loop system to an initial value of y and verify 
that the response is consistent with the desired poles. 


Ed l 
ne zs| 3 
$-[5 1]. r«[il. 
compute K by hand so that the poles of the closed-loop system with full state feedback 


are at z = 0.9 t j0.1. 


For the open-loop system 


For the open-loop system 


l 
ow) = 9. Vl. 
u(s) s*(s + 2) 
(a) Find the discrete state-space model, assuming there is a ZOH and the sample rate 
is T = 100 msec. 


(b) Pick poles so that the settling time t, < 1 sec and find the K that will produce those 
poles with full state feedback. 


(c) Verify that z, is satisfied by plotting the response to an initial value of y. 


For the system in Problem 8.2, find the estimator equations and the value of the gain L 
by hand so that z,,. = 0.6 + j0.3 for 


(a) a predictor estimator, 
(b) acurrent estimator, and 
(c) areduced-order estimator (z am 0.6). 
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8.5 For the system in Problem 8.3, 

(a) Find the predictor estimator equations and the value of the gain L, so that the 
estimator t, < 0.5 sec. 

(b) Verify that t, is satisfied by plotting the response of x, (= y) to an initial value. 

8.6 For the open-loop system 
4 XGA. A 
Nc u(s) ~ 52 

preceded by a ZOH and a sample rate of 20 Hz, 

(a) Find the feedback gain K so that the control poles have an equivalent s-plane c, = 
10 rad/sec and ġ = 0.7. 

(b) Find the estimator gain L, so that the estimator poles have an equivalent s-plane 
w, = 20 rad/sec and ġ = 0.7. 

(c) Determine the discrete transfer function of the compensation. 

(d) Design a lead compensation using transform techniques so that the equivalent 
s-plane natural frequency w, = 10 rad/sec and ¢ = 0.7. Use either root locus or 
frequency response. 

(e) Compare the compensation transfer functions from (c) and (d) and discuss the 
differences. 

8.7 For the system in Problem 8.6, design the controller andestimator so that the closed-loop 
unit step response to a command input has a rise time t, < 200 msec and an overshoot 

M, <15% when using: 

(a) the state command structure, 

(b) the output error command structure. 

For both cases, check that the specifications are met by plotting the step response. 

8.8 For the open-loop system 
ys) l 
G(s) = —— = >=: 
sae s*(s* + 400) 
design the controller and estimator so that the closed-loop unit step response to a com- 
mand input has a rise time t, < 200 msec and an overshoot M, <15% when using 
the state command structure. Check that the specifications are met by plotting the step 
response. T= 30 ms. 
8.9 Compute G (z) from Eq. (4.64) for 
—a, —4à, —4, l 
@ = 1 0 0 . =|] 0 |, H=[b, b, b. 
0 l 0 0 
Why is this form for ® and F called control canonical form? 
8.10 (a) For 


e) eg] 


find a transform matrix T so that if x = Tw, then the equations in w will be in 
control canonical form. 
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8.11 


8.12 


8.13 


8.14 


(b) Compute K , the gain, such that if u = —K „w, the characteristic equation will be 
a (z) = z? — 1.6z + 0.7. 

(c) Use T from part (a) to compute K,, the gain in the x-states. 

(a) Show that the equations for the current estimator can be written in standard state 
form 

bkt = AS, + By, u = CE, + Dy, 

where $, = £X, -L y, A = (I- L,H)(® — TK), B = AL, C = —K, and 
D = -KL 

(b) Use the results of Eq. (4.65) to show that the controller based on a current estimator 
always has a zero at z = 0 for any choice of control law K or estimator law L.. 


For the open-loop system 


check the observability for: 

(a) H=[0 IJ. 

(b H=[1 0]. 

(c) Rationalize your results to (a) and (b), stating why the observability or lack of it 
occurred. 

Design the antenna in Appendix A.2 by state-variable pole assignment. 


(a) Write the equations in state form with x, = y and x, = y. Give the matrices F, G, 
and H. Let a = 0.1. 


(b) Let T = 1 and design K for equivalent poles at s = —1/2 + j (3/2). Plot the step 
response of the resulting design. 


(c) Design a prediction estimator with L, selected so that a, (z) = z^; that is, both 
poles are at the origin. 


(d) Use the estimated states for computing the control and introduce the reference input 
so as to leave the state estimate undisturbed. Plot the unit step response from this 
reference input and from a wind gust (step) disturbance that acts on the antenna just 
like the control force (but not on the estimator). 


(e) Plot the root locus of the closed-loop system with respect to the plant gain and 
mark the locations of the closed-loop poles. 


In Problem 7.8 we described an experiment in magnetic levitation described by the 
equations 
X = 1000x + 20u. 


Let the sampling time, T, be 10 msec. 


(a) Use pole placement to design this system to meet the specifications that settling 
time is less than 0.25 sec and overshoot to an initial offset in x is less than 20%. 


(b) Design a reduced-order estimator for x for this system such that the error-settling 
time will be less than 0.08 sec. 


(c) Plot step responses of x, x, and u for an initial x displacement. 
(d) Plot the root locus for changes in the plant gain and mark the design pole locations. 


(e) 
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Introduce a command reference with feedforward so that the estimate of x is not 
forced by r. Measure or compute the frequency response from r to system error 
r — x and give the highest frequency for which the error amplitude is less than 20% 
of the command amplitude. 


8.15 Derive Eq. (8.63) from Eq. (8.57). 
8.16 For the open-loop system 


(a) 


(b) 


(c) 


(d) 


(e) 


X)  y(z O.l185(z- 0.9669) 
u(z)  w(z) z'— 1.6718 z + 0.9048" 

Find the control feedback K and estimator gain L, that will place control poles at 
z = 0.8 + j0.2 and estimator poles at z = 0.6 + 70.3. 
Plot the response of y(k) for a unit step input (7) command using the state command 
structure. Would there be a steady-state error if N,,= 0? 
Determine what the steady state value of y(k) would be if there was an input 
disturbance, w. 
Determine an integral control gain and show the system block diagram with the 
integral control term included. Set the extra control pole at z — 0.9. 


Demonstrate that the system will have no steady state error for a disturbance, w, or 
an input command, u, even when N,= 0. 


8.17 For the open-loop system 


(a) 


(b) 
(c) 


(d) 


0.8815 0.4562 0.1185 
v= —0.4562 0.7903 I Lel, -| 0.4562 |: H= 10], 


Find the control feedback K and estimator gain L, that will place control poles at 
z = 0.6 + j0.3 and estimator poles at z = 0.3 + J0.3. 


Plot the response of y(k) for a unit step of the disturbance (w). 


Determine an integral control gain and show the system block diagram with the 
integral control term included. Set the extra control pole at z = 0.9. 


Demonstrate that the system will have no steady-state error for a disturbance, w. 


8.18 For the open-loop system from Problem 8.17 


(a) 


(b) 


(c) 


Assuming w takes on some unknown but constant value, construct an estimator that 
includes an estimate of that disturbance. Place poles of the system as in Problem 

8.17, except place the extra estimator pole at z = 0.9. Determine values of K and 
L and sketch the block diagram showing how the various quantities are used in 

the control. Include the command input r in the diagram using the state command 
structure. 


Plot the response of y and w to a unit step in w with r = O0. State whether the 
responses meet your expectations. 


Plot the response of y and w to a unit step in r with w = 0. State whether the 
responses meet your expectations. 


8.19 A disk drive read head has the open-loop transfer function 


ys) ys) 1000? 


G = = SS SSE 
(s) u(s) ws) ss T2250, s tol) 
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where w, = 6000 rad/sec and ¢, = 0.02. 


(a) Design a digital compensation so that there is no steady state error to an input 
command nor to a constant disturbance w. The plant gain of 1000 is not known 
precisely, so it is not acceptable to assume the steady state error due to input 
commands can be eliminated via feedforward with an N, term. The specifications 
are that the rise time 7, must be less than 2 msec and the overshoot M, « 10%. Use 
a sample rate of 3 kHz. 


(b) The disk spins at 3000 rpm. There is typically a small offset between the center of 
the circular tracks on the disk and the center of rotation, thus producing a wobble 
in the location of the tracks that should be followed by the read head. Determine 
the track following error due to this wobble. Express the answer as a percentage of 
the wobble magnitude. 


(c) Embellish your design from (a) so that the error due to the wobble is eliminated as 
best you can. Plot the frequency response of the tracking error (e in Fig. 8.32) where 
the input to the system is the track wobble. Mark the frequency that represents the 
spin rpm. 


8.20 A pendulum with a torquer at its hinge is described by 


8.21 


8.22 


BPLE UE. 
y-[ a[2 +e 


where x” = {0 6], 0 = angle of the pendulum from vertical, u = torque, w = torque 
bias, and b — measurement bias. Answer the questions below assuming the output is 
sampled with 7 — 100 msec and the control (u 4- w) is applied through a ZOH. 


(a) With no torque bias (w = 0), augment the system model so that the measurement 
bias is a state element. Is this system observable? 


(b) With no measurement bias (5 = 0), augment the system model so that the torque 
bias is a state element. Is this system observable? 


(c) Augment the system model so that both biases are state elements. Is this system 
observable? 


Design a controller (control plus estimator) for the same system as in Example 8.12, 
except add a delay of two sample periods between the system output and when the 
measurement is available to the computer. 


(a) Compute K for zi. = 0.8 + 0.25, 0, 0. 

(b) Is the system observable? Check for predictor and current estimators. 
(c) Compute L, with Zas = 0.4 + j0.4, 0, 0. 

(d) Compute L, with z,. as in part (c) and with Zy, = 0.4 + j0.4, 0.1, 0. 


(e) Plot the unit step response to an input command using the predictor estimator 
showing the plant output as well as the delayed sensor output. 


Determine the state-command input structure for a feedback system with the Type 0 plant 
10 

s’ +0.18s +9 

(a) Convert the system to discrete state-space form with T = 0.1 sec. 


G(s) = 


8.23 


8.24 
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(b) Find K to obtain equivalent s-plane control poles at § = 0.5 and w, = 4 rad/sec. 
(c) Find L, to obtain equivalent s-plane estimator poles at ¢ = 0.5 and w, = 8 rad/sec. 


(d) Determine N, and N,, then sketch a block diagram specifying the controller 
equations including the reference input. 


(e) Do you expect there will be a steady-state error for this system for a step input? 
(f) Plot the step response and confirm your answer to (e). 


Repeat the design of the controller (control and estimator) for Examples 8.4 and 8.8, but 
place all four desired poles at z = 0. (This is often referred to as a finite settling time or 
deadbeat design because the system will settle in a finite number of sample periods.) 


(a) Using your K and L „ determine the time response of the system to a unit-step 
input using the state-command input structure. 


(b) Determine the compensation in transfer function form, D, (2), and construct a root 
locus vs. the dc gain of D, (z). Mark the roots where the gain corresponds to the 
values computed for K and L,. 


The double mass-spring device described in Appendix A.4 is representative of many 
devices that have some structural resonances. Placing the sensor so that it measures 
y is called the colocated case, whereas placing it so that it measures d is called the 
noncolocated case. Often, the designer is not aware initially that a resonance exists in 
the system, a situation that is addressed by this problem. 

For M = 20kg,m = | kg, k = 25 N/m, and b = 0.2 N-sec/m, we obtain a resonance 
frequency of 5 rad/sec with a damping ratio, = 0.02. 


(a) Torepresent the case where the designer did not know about the resonance, assume 
the coupling is rigid, that is, k is infinite. The transfer function is then 
d l 
Gp OE E 
1 u(s)  u(s) (m+M)s 
Design a digital controller (K and L,) with 7 = 200 msec, control poles at 
z =0.75 + j0.2, and estimator poles at z = 0.3 + j0.3. Verify by simulation that 
it provides a response to a unit-step command using the state-command input 
structure that is consistent with the selected poles. 


(b) Use the controller (K and L ) obtained in part (a) in a simulation of the system 
where the infinite spring is replaced with the flexible one and the output is d, that 
is, a fourth-order plant with second-order controller. Examine the response and 
compare it qualitatively with an analysis of the closed-loop roots of this combined 
system. 


(c) Repeat part (b), but replace the plant output, d, with y. 


(d) Analyze where the roots of the system would be if you measured y and y directly 
(no estimator) and fed them back using your K from part (a). 


(e) Design a fourth-order controller with control poles at z = 0.75 + j0.2, 0.4 + j0.6, 
and estimator poles at z = 0.3 + j0.3, 0 + j0.4 with d as the measurement. Again, 
verify by simulation that it provides the correct response to a unit-step command 
using the state-command input structure. 


(f) Plot the frequency response of the compensation (control plus estimator) from part 
(e). State why you think this kind of compensation is usually referred to as a notch 
filter. 
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(g) Plot the z-plane root locus of the system (plant plus controller) and comment on 
the sensitivity of this design to changes in the overall loop gain. 
8.25 A heat exchanger? has the transfer function 
e^ 
UN E ED PU. 

where the delay is due to the sensor. 
(a) Write state equations for this system. 
(b) Compute the zero-order-hold model with a sample period of 5 sec. 


(c) Design the compensation including the command input with the control poles at 
0.8 + j0.25 and the estimator poles at 0.4 + j0.4, 0. 


(d) Compute the step response to a reference input step and to a disturbance input at 
the control. Verify that there is no delay for the command input. 


22 See Franklin, Powell, and Emami (2019), Example 2.18. 


A E 


Multivariable and Optimal Control 


A Perspective on Multivariable and Optimal Control 


The control-design procedures described in Chapters 7 and 8 were applied to 
systems with a single input and single output (SISO). The transfer-function 
approach in Chapter 7 is best suited to SISO systems; the state-space methods 
of Chapter 8 were limited to SISO in order to simplify the procedures. In fact, 
if we try to apply the pole-placement approach of Chapter 8 to a multivariable 
(multi-input, multi-output, or MIMO) system, we find that the gains, K or L, are 
not uniquely determined by the resulting equations. Therefore, a design approach 
is required which intelligently uses this extra freedom for MIMO systems. In 
addition, we saw in Example 8.3 that the selection of desired pole locations for 
SISO systems can be tricky business. Some sort of systematic guidance for the 
selection of control and estimator pole locations seems highly desirable. The 
material in this chapter provides a tool that meets both these needs. 

The subject of this chapter is the use of optimal control techniques as a tool 
in the design of control systems. It is important that the designer have no illusions 
that some true “optimal” design is being achieved; rather, the idea is to transfer 
the designer's iteration on pole locations as used in Chapter 8, or compensation 
parameters as used in Chapter 7, to iterations on elements in a cost function, 
J. The method will determine the control law that minimizes J, but because 
the parameters in 7 are arbitrarily selected, the design is at best only partially 
optimal. However, these designs will achieve some compromise between the use 
of control effort and the speed of response and will guarantee a stable system, no 
small feat. Therefore, each iteration on the parameters in 7 produces a candidate 
design that should be evaluated in the light of the design specifications. 


Chapter Overview 


The chapter starts out in Section 9.1 by discussing some of the steps that might be 
taken in order to convert a MIMO system into a SISO one. Although this cannot 
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9.1 


Figure 9.1 


always be done, it can help to clarify the key control issues in a complex system so 
that the later optimization of the whole system will be better understood. Section 
9.2 derives the time-varying optimal control solution that results directly from 
the optimal control problem statement. Section 9.3 shows how to find the steady- 
state value of the optimal feedback gain that is significantly easier to implement 
and is the one typically used in control implementations. Section 9.4 derives the 
companion optimal estimation problem. As for the control case, the time-varying 
gain solution is found first, then we find the steady-state gain case that is typically 
the one implemented. The final section, 9.5, shows how to use these results in the 
design of MIMO control systems. 


Decoupling 


The first step in any multivariable design should be an attempt either to find an 
approximate model consisting of two or more single input--output models or else 
to decouple the control gain matrix K and the estimator gain matrix L. This step 
will give better physical insight into the important feedback variables and can 
lead to a plant description that is substantially simpler for design purposes and yet 
yields no significant degradation from an analysis based on the full multivariable 
system. 

For example, the linearized equations of motion of an aircraft (Fig. 9.1) are 
of eighth order but are almost always separated into two fourth-order sets rep- 
resenting longitudinal motion (w, u, q) and lateral motion (p, r, v). The elevator 


Schematic of an aircraft showing variable definitions 


Elevator 6, 
Rudder ô, 


Velocity components: 

u 7 forward rate 

v 7 side-slip rate 
(lateral speed) 


w 7 altitude rate 
Angular velocity components: 
á A3 p = roll rate 
~> q = pitch rate 
r = yawrate 
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control surfaces affect longitudinal motion; the aileron and rudder primarily af- 
fect lateral motion. Although thereis a small amount of coupling of lateral motion 
into longitudinal motion, this is ignored with no serious consequences when the 
control, or “stability-augmentation,” systems are designed independently for the 
two fourth-order systems. 

Further decoupling of the equations is also possible. 


€ Example 9.1 


Control Decoupling 


Decouple the lateral aircraft fourth-order equations into two second-order equations and show 
how to design the control. 


Solution. The aircraft lateral equations are multivariable and of the form 


x(k + 1) = x(k) + Puls), (9.1) 
where 
v 
ô, r E 
wes $T. x b, , and p=$,. 
p 
A control law of the standard form 
v 
à, Ka Ky Ky Ky | i 
P nee 2 (9.2) 
| 8, | | Ka Ky Ky Ky, $, 
p 


shows that there are eight elements in the gain matrix to be selected, and the specification of 
four closed-loop roots clearly causes the problem to be underdetermined and will leave many 
possible values of K that will meet the specifications. This shows that the polc-placement 
approach to multivariable system design poses difficulties and additional criteria need to be 
introduced. 

A decoupling that removes the ambiguity is to restrict the control law to 


5 Ke K 0 0 r 
le |==| d ai E E] $, R (9.3) 


This makes good physical sense because the rudder primarily yaws the aircraft about a vertical 
axis (r-motion), thus directly causing sideslip (v), and the ailerons primarily roll the aircraft 
about an axis through the nose, thus causing changes in the roll angle, $,. and the roll rate, p. 
Given an achievable set of desired pole locations, there are unique values of the four nonzero 
components of K; however, the governing equations cannot be cast in the same form as in 
Eq. (8.5) and therefore can be difficult to solve. 
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A further decoupling that would permit an easy gain calculation is to assume that Eq. (9.1) 
is of the form 


li $; 6$, 0 0 T, 0 
r $4, % 0 0 r D. 0 | ô | 

=| "2 tm +] 2 r 9.4 
$, 0 0 $5, 34 $, 0 ry 8, k 
P Jey 0 0 n 6, P J, 0 T, 


and that the control law is given by Eq. (9.3). 

This makes some physical sense but ignores important coupling between the two aircraft 
modes. It does, however. decouple the system into second-order systems for which the methods 
of Chapter 8 can be applied directly to obtain the gains. The resulting closed-loop characteristic 
roots of the full lateral equations can be checked by calculating the eigenvalues of the closed 
loop matrix: (see eig.m in MATLAB) 


closed loop 
$n -njK, $n -njK, $,, - ';K,, $,, ~ T; Kj, 
$, — FaK bn — FaK $23 — r’,,K,, $4, — T5 K,, (9.5) 


$3, — FK, $5 — FK, $3, — DK, Oa, — Fy Ka : 
$4 — ai $, T FaK [m = I,K, 
which results from combining Eq. (9.3) and (9.4). 

If the plant coupling that was ignored in the gain computation is important, the roots 
obtained from Eq. (9.5) will differ from those used to compute the gains using Eqs. (9.3) and 
(9.4). In many cases, the method will be accurate enough and one need look no further. In 
other cases, one could revise the "desired" root locations and iterate until the correct roots 
from Eq. (9.5) are satisfactory or else turn to the methods of optimal control to be described in 
the following sections. 


$, =i 42 K,, 


* 


The same ideas apply equally well to the decoupling of the estimator into 
SISO parts. 


@ Example 9.2 


Estimator Decoupling 


Decouple the estimator for the inverted pendulum on a motorized cart (Fig. 9.2). 


Solution. The equations of motion can be written as 


X. =, | Be S X, E. 
| X, I" E | Poe $.. Jf X. | + | T, Jew. 


x = | x l cart position and velocity, 
€ 


(9.6) 


where 


| ; | stick angle and angular rate, 


Figure 9.2 
Hinged stick and 
motorized cart 
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Jc 


and the available measurements are 


[eline e o] 


The stick pictured in Fig. 9.2 is substantially lighter than the cart. This means that stick 
motion has a small dynamic effect on cart motion, which in tum implies that ¢,, = 0. This 
does not imply that ¢,. = 0; in fact, cart motion is the mechanism for influencing stick motion 
and hence stabilizing it. 

An estimator for the system described by Eq. (9.6) and Eq. (9.7) requires the determination 
of eight elements of an estimator gain matrix, L. Hence, specifying the four estimator roots 
and using the methods of Chapter 8 would not determine this L uniquely—another example 
of an underdetermined system caused by the multivariable nature of the problem. 

But because we can assume that $,, = 0, the cart equation in Eq. (9.6) uncouples from 
the stick equation, and we simply design an estimator for 


(9.7) 


DOK 


x(k +1) = aX (K) + Pu, (9.8) 
Y, 
=[]1 0 Ca le 
y=[ Ji y | 


which can be done with the methods described in Chapter 8. There is one-way coupling into 
the stick equation, but this just acts like an additional control input and can be ignored in 
the calculation of the stick estimator gain matrix, L., using the pole-placement methods of 
Chapter 8. However, the $,. coupling should not be ignored in the estimator equations, and 
there is no reason to ignore the weak coupling, ¢,,. The final (predictor) estimator would be 
of the form 


x(k + 1) 
x(k + 1) 


$.,8.(k) + Ø, X (k) + T; (E) + L, (lk) — 5,00). 


$,.%,(k) +Ø X (k) +F ulk) +L,@(k) — 60), 
where L, and L, are both 2 x 1 matrices. 

Even without the very weak one-way coupling in $.. that was obvious for this example, 
one could assume this to be the case, then check the resulting full-system characteristic roots 
using a method similar to the previous airplane example. Note that ignoring the coupling only 
causes approximations in the gain-matrix calculation and thus the root locations. There is no 
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approximation in the system model used in the estimator; thereforc, the estimation errors will 
still approach zero for stable estimator roots. 


cost function 


9.2 


+ 


In short, it is often useful to apply your knowledge of the physical aspects of 
the system at hand to break the design into simpler and more tractable subsets. 
With luck, the whole job can be finished this way. At worst, insight will be gained 
that will aid in the design procedures to follow and in the implementation and 
checkout of the control system. 


Time-Varying Optimal Control 


Optimal control methods are attractive because they handle MIMO systems easily 
and aid in the selection of the desired pole locations for SISO systems. They also 
allow the designer to determine many good candidate values of the feedback 
gain, K, using very efficient computation tools. We will develop the time-varying 
optimal control solution first and then reduce it to a steady-state solution in the 
following section. The result amounts to another method of computing K in the 
control law Eq. (8.5) 


u = —Kx (9.9) 


that was used in Chapter 8 and illustrated by Eq. (9.2) in Example 9.1. 
Given a discrete plant 


x(k + 1) = ®x(k) + Fuck), (9.10) 


we wish to pick u(x) so that a cost function 


1 N ! 
J=; 25 bs Qa + u^ (k)Q,u(k)] (9.11) 


is minimized. Q, and Q, are symmetric weighting matrices to be selected by the 
designer, who bases the choice on the relative importance of the various states and 
controls. Some weight will almost always be selected for the control (|Q,| = 0); 
otherwise the solution will include large components in the control gains, and 
the states would be driven to zero at a very fast rate, which could saturate the 
actuator device. The Q's must also be nonnegative definite,’ which is most 


1 If the sampling rate, T, is long, however, a control that moves the state along as rapidly as possible 
might be feasible. Such controls are called “dead-beat” because they beat the state to a dead stop in at 
most » steps. They correspond to placement of all poles at z = 0. See Problem 8.23. 

2 Matrix equivalent of a nonnegative number; it ensures that x7Q,x and u’ Q,u are nonnegative for all 
possible x and u. 


Lagrange multipliers 


adjoint equations 
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easily accomplished by picking the Q’s to be diagonal with all diagonal elements 
positive or zero. 

Another way of stating the problem given by Eqs. (9.10) and (9.11) is that 
we wish to minimize 


l N 
=5 Dr woe +u7(k)Q,u(k)] [9.11] 


subject to the constraint that 
—x(k + 1) + x(k) + Puck) 20, k =0,1,..., N. [9.10] 


This is a standard constrained-minima problem which can be solved using 
the method of Lagrange multipliers. There will be one Lagrange multiplier 
vector, which we will call A(k + 1), for each value of k. The procedure is to 
rewrite Eqs. (9.10) and (9.11) as 


ul l 
T=}. ECG + 5u  ()Q,uQ) 
k=0 
+ AT (k + 1)(—x(k + 1) + ®x(k) + Tu(k))], (9.12) 


and find the minimum of J’ with respect to x(k), u(k), and A(k). Note that for an 

optimal u(&) that obeys Eq. (9.10), the two cost functions, 7’ and J, are identical 

in magnitude. The index on A is arbitrary conceptually, but we let it be k + 1 

because this choice will yield a particularly easy form of the equations later on. 
Proceeding with the minimization leads to 


ð + 
a = u” (k)Q, +A’ (k+DFr=0, control equations, (9.13) 
ə 7 
nT = —x(k + 1) + x(k) + Puck) 20, state equations, and [9.10] 
0 7 
S = x' (k)Q, — A'(k) - A (k +1) —0, adjoint equations. (9.14) 


The last set of the equations, the adjoint equations, can be written as the back- 
ward difference equation 


A(k) = $'A(k + 1) + Q,x(&). (9.15) 
Restating the results in more convenient forms, we have from Eq. (9.10) 
x(k + 1) = x(k) + Tuck), (9.16) 
where, using Eq. (9.13) 
u(k) = -Q;' T"A(k + 1), (9.17) 
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sweep method 


and from Eq. (9.15) we can describe A(k + 1) in the forward difference equation 
form 

Alk + 1) = PACK) — OQ, x(k). (9.18) 
Equations (9.16), (9.17), and either (9.15) or (9.18) are a set of coupled difference 
equations defining the optimal solution of x(k), A(k), and u(k), provided the 
initial (or final) conditions are known. The initial conditions on x(k) must be 
given; however, usually A(0) would not be known, and we are led to the endpoint 
to establish a boundary condition for A. From Eq. (9.11) we see that u(N) 
should be zero in order to minimize J because u(/V) has no effect on x(V) [see 
Eq. (9.10)]. Thus Eq. (9.13) suggests that A(N + 1) = 0, and Eq. (9.14) thus 
shows that a suitable boundary condition is 


A(N) = Q,x(N). (9.19) 


A set of equations describing the solution to the optimal control problem is 
now completely specified. It consists of the two difference equations (9.16) and 
(9.15) with u given by Eq. (9.17), the final condition on A given by Eq. (9.19), and 
the initial condition on x would be given in the problem statement. The solution 
to this two-point boundary-value problem is not easy. 

One method, called the sweep method by Bryson and Ho (1975), is to 
assume 


A(k) = S(k)x(k). (9.20) 


This definition allows the transformation of the two-point boundary-value prob- 
lem in x and A to one in S with a single-point boundary condition. With the 
definition Eq. (9.20), the control Eq. (9.13) becomes 


Q,u(k) = -T'S(k + 1)x(k + 1) 
= -I"S(k + 1)(@x(k) + Pu(k)). 
Solving for u(k), we obtain 
u(k) = —(Q, + 'S(k + DD) T'S(k + 1) ®x(k) 
= —R™'T’S(k + 1)®x(k). (9.21) 
In Eq. (9.21) we have defined 
R = Q, + I"S(k + DP 


for convenience. If we now substitute Eq. (9.20) into Eq. (9.15) for A(k) and 
A(k + 1), we eliminate A. Then we substitute Eq. (9.21) into Eq. (9.16) to elimi- 
nate x(k + 1) as follows. From Eq. (9.15), we have 


A(k) = ®7A(k + 1) + Q,x(k), 
and substituting Eq. (9.20), we have 
S(k)x(k) = B7S(k + Dx(k + 1) + Q,x(k). 


Riccati equation 
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Now we use Eq. (9.16) for x(k + 1) 
S(k)x(k) = 9'S(k + 1)(@x(k) + Puck) + Q, x(k). 
Next we use Eq. (9.21) for u(K) in the above 
S(k)x(k) = P” S(k + 1)[®x(k) — PR^'T'S(k + 1)@x(k)] + Q, x(k), 
and collect all terms on one side 


[S(k) — B7S(k + 1) + O’S(k + DTR^T'S(k + 1) — Q,]x(k) = 0. 
(9.22) 
Because Eq. (9.22) must hold for any x(k), the coefficient matrix must be iden- 
tically zero, from which follows a backward difference equation describing the 
solution of S(k) 


S(k) = ® [S(k + 1) — S(k + DFPR^'T'S(& + 1)1®+Q,, (9.23) 
which is often rewritten as 
S(k) = 9" M(k + 1) 4- Q,. (9.24) 


where 
M(k + 1) = S(k + 1) - S(k + DTTG, + P" S(k + DET T'S(c + 1). (9.25) 


Equation (9.23) is called the discrete Riccati equation. It is not easy to solve 
because it is nonlinear in S. But note that the matrix to be inverted in Eq. (9.25) 
R, has the same dimension as the number of controls, which is usually less than 
the number of states. 

The boundary condition on the recursion relationship for S(k + 1) is obtained 
from Eq. (9.19) and Eq. (9.20); thus 


S(N) =Q,, (9.26) 


and we see now that the problem has been transformed so that the solution is 
described by the recursion relations Eq. (9.24) and Eq. (9.25) with the single 
boundary condition given by Eq. (9.26). The recursion equations must be sol ved 
backwards because the boundary condition is given at the endpoint. To solve for 
u(k), we use Eq. (9.21) to obtain 


u(k) = —K(k)x(k), (9.27) 
where 
K(k) = IQ, + I" S(k + DET 'T' S(k + 1) (9.28) 
and is the desired “optimal” time-varying feedback gain. 


Let us now summarize the entire procedure: 


1. Let S(N) = Q, and K(N) = 0. 
2. Letk =N: 
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. Let M(k) = S(k) — S(OTI[Q, + FP’ SAP] 'LPS(K). 
. Let K(k — 1) = [Q; + PSY PL S(k)®. 

. Store K(k — 1). 

Let S(k — 1) = 9^ M()ó + Q.. 

Letk 2 k—1. 

. Go to step 3. 


9 3090 t^ 2 ow 


For any given initia! condition for x, to apply the control, we use the stored gains 
K(k) and 


x(k + 1) = x(k) + Fu(X), [9.16] 
where 
u(k) = —K(k)x(k). [9.27] 


Note that the optimal gain, K(k), changes at each time step but can be pre- 
computed and stored for later use as long as the length, N, of the problem is 
known. This is so because no knowledge of the initial state x(0) is required for 
computation of the control gain K(k). 


Time-Varying Nature of Control Gains 


Solve for the time history of K for the satellite attitude-control example described in Appendix 
A.1. Choose the state weighting matrix to be 


g=] at (9.29) 


which means that the angle state is weighted but not the angular velocity. Choose the control 
weighting matrix, a scalar in this case because there is a single control input, to have three 
values 


Q,=0.01, 0.1, and 1.0, (9.30) 


and plot the resulting time histories of K. 


Solution. Equations (9.24) through (9.28) need to be solved for the system transfer function 
G(s) = 1/s?. The problem length for purposes of defining J was chosen to be 51 steps, which, 
with the sample period of T = 0.1 sec. means that the total time was 5.1 sec. This time was 
chosen long enough so that it was apparent that the gains were essentially constant over the 
initial time period. 

Figure 9.3 contains the resulting gain time histories plotted by the computer. We see from 
the figure that the problem length affects only the values of K near the end, and in fact, the 
first portions of all cases show constant values of the gains. If the problem length had been 


Figure 9.3 
Example of control gains 
versus time, Example 9.3 
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Control gain, Kg 


chosen to be longer, the end characteristics would have been identical, and the early constant 
gain portion would have existed for a longer time. 


* 


The fact that the gain over the first portion of the example was constant is 
typical of the optimal gains for all constant coefficient systems, provided that 
the problem time is long enough. This means that the optimal controller over 
the early, constant-gain portion is identical to the constant-gain cases discussed 
in Chapter 8 and Section 9.1 except that the values of the constant gain, K, are 
based on the minimization of a cost function rather than a computation based 
on specified root locations. We could also view this result as a method to find 
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a “good” set of pole locations because the constant, optimal K during the early 
portion of the solution determines a set of closed-loop roots. 

For MIMO problems, the time-varying gains act exactly as in the preceding 
example. The next section develops a method to compute the constant value of 
the optimal gains so that they can be used in place of the time-varying values, 
thus yielding a much simpler implementation. The only region where the gains 
are not optimal is during the transient region near the end. In fact, many control 
systems are turned on and left to run for very long times; for example, a satellite- 
attitude control system might run for years. This kind of problem is treated 
mathematically as if it runs for an infinite time, and therefore the constant-gain 
portion of the time-varying optimal solution és the true optimal. 

Before we leave the time-varying case it is informative to evaluate the optimal 
cost function 7 in terms of X and S. If we substitute Eqs. (9.13) and (9.14) for 
AT (k 4- 1)F and A" (k + 1)® in Eq. (9.12), we find 


N 
T = 5) 3X (0Q,x6) + u (0,00 — M (c + Dx + 1) 
k=0 
HATH) — xQ, x(k) + (-07Q,)U(&)] 


1 N 
752 4M xt) - A (k Dx(c + 0) 
k=0 
1 l 
= A! (0)x(0) — 3 AT(N  Dx(N + 1). 
However, from Eq. (9.19), A(N + 1) = 0, and thus, using Eq. (9.20), we find 


l 
T=J= ^ (Ox(0) 
= 5x" SOX). (9.31) 


Thus we see that having computed S, we can immediately evaluate the cost 
associated with the control. Although the cost could be used in evaluating dif- 
ferent candidate designs, in fact, it is not very useful because the weighting 
matrices, Q, and Q,, are arbitrary quantities that change with the different de- 
signs. Furthermore, the value of the discrete cost as defined by Eq. (9.11) is 
roughly proportional to the sample rate, thus eliminating any usefulness in eval- 
uating the performance for different sample rates. Typically a designer would 
evaluate different designs by looking at the traditional measures that have been 
discussed in Chapter 7 or possibly by evaluating quadratic performance mea- 
sures that are independent of the Q’s and the sample rate. We will show in 
Section 9.3.4 how to remove the effect of sample rate on the value of the 
cost. 


9.3 


regulator 


LQR 


algebraic Riccati equation 


Euler-Lagrange equations 
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LQR Steady-State Optimal Control 


The previous section developed the optimal control gain that minimized the cost 
in Eq. (9.11). We saw that the result was a time-varying gain, K(k), but that 
there would usually be a portion of the solution that produced a constant gain, 
Ko which would be much easier to implement in a control system. In fact, for 
the infinite time problem, called the regulator case, the constant-gain solution 
is the optimum. We call this solution the linear quadratic regulator, or LQR, 
because it applies to linear systems, the cost is quadratic, and it applies to the 
regulator case. This section will discuss how one finds the LQR solution and 
various properties of the solution. 

One obvious method to compute the value of K during the early, constant 
portion of a problem is to compute S backward in time until it reaches a steady 
value, $,,, then use Eq. (9.28) to compute K,,. This has been done in some 
software packages and gives reliable answers. Its drawback is that it requires 
substantially more computation than the alternate methods. 

Another method is to look for steady-state solutions of the Riccati equation. 
In steady state, S(k) becomes equal to S(k + 1) (we'll call them both S, ) and 
the Riccati Eq. (9.23) reduces to 


S. E: DiS, = S „IR 'T'S _]® 4 Q,, (9.32) 


which is usually referred to as the algebraic Riccati equation. Because of the 
quadratic appearance of S, , there is more than one solution, and one needs to 
know that S must be positive definite to select the correct one. The fact that S is 
positive definite follows by inspection of Eq. (9.31) and that J must be positive. 
For extremely simple problems, one is sometimes able to use Eq. (9.32) to find an 
analytical solution for S ,, but in most cases this is impossible, and a numerical 
solution is required. 

Most software packages use variations on a method called eigenvector de- 
composition due to its superior computational efficiency compared to the meth- 
ods above. It is based on the linear description of the combined state and adjoint 
equations given by Eqs. (9.16), (9.17), and (9.18), which describe the time-varying 
solution. We can combine these equations into a set of difference equations in 
standard form in x and A if we assume that Q, and ® are nonsingular? These 
equations are called Hamilton's equations or the Euler-Lagrange equations 


x $-rog;r'e'Q -ro;r'e" | | x | 
- 2 ! He 9.33 
| A E | ay "Q, p” À k i i 


3 For systems with a pure time delay that is greater than the sampling period, T, ® is singular and the 
following development would fail. Software packages usually have features in their formulations that 
circumvent this difficulty. 
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9.3.1 


and their system matrix is called the control Hamiltonian matrix 


[ +r E Q, -rQ T o | 
-$^Q, o i 


Because the system described by Eq. (9.33) is linear and the Hamiltonian matrix 
is constant, we can solve for the eigenvalues of Eq. (9.34) [or roots of Eq. (9.33)] 
using standard techniques (see eig.m in MATLAB). For an nth order system, there 
will be 2n eigenvalues. We will show in the next section that n of the roots are 
stable and the other n are unstable. In fact, the n unstable roots are the reciprocals 
of the n stable roots. Furthermore, the n stable roots are the roots of the optimal, 
constant-gain, closed-loop system! If we were trying to find the optimal K for a 
SISO system, the problem would now be complete, because knowing the optimal 
roots allows us to use Ackermann's formula to find K. But we wantthe optimal K 
for MIMO systems, too, so it's not that simple. We will return to the eigenvector 
decomposition solution after establishing the characteristics of the roots just 
stated. 


em 


€ 


(9.34) 


Reciprocal Root Properties 


Let us turn to the question of the reciprocal nature of the roots of Eq. (9.33). If 
we take the z-transforms of Eqs. (9.16), (9.17), and (9.15), we obtain 


zX(z) = ®X(z) + FU(z), (9.35) 
U(z) = —zQ; T' A(z), (9.36) 
A(z) = Q,X(z) + zb' A(z). (9.37) 


If we substitute Eq. (9.36) into Eq. (9.35) and write the remaining two equations 
in terms of the variables X(z) and zA(z), we find, in matrix form 


d-® rQ’ X(z) — 10) 
-Q, z1-9" eA) | ^ 


Thus the roots of the Hamiltonian system are those values of z for which 


d-9 rQ] 
aer | -Q, zI- 0’ =0. 


If we now reduce the term —Q, to zero by adding Q, (zI — $)’ times the first 
rows to the second rows, we have 


I- o FOT l 
det | * 2 =0. 
: [ 0 z'"I-9' 4 Q,GI - 4)" TQ; T7 


Because this matrix is blockwise triangular, we have 


det(zI — ®)det{z-'I — ^ + Q, zI- 9)" TQ;'T"] = 0. 


9.3.2 
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Now we factor the term z^'1 — «P from the second term to find 
det(zI — d)det((z^'I — &"){1 + (z^'E— 9^) 'Q, (I — ©) TQ;'T")] = 0. 


To simplify the notation, we note that det(zI — ®) = a(z), the plant characteristic 
polynomial, and det(z^'I — $^) = a(z ^). Thus, using the fact that det AB — 
det A det B, we find that the Hamiltonian characteristic equation is 


a(z)a(z~')det{I + p(z^'1 — ^)" HTHGI — 6)" TP } — 0. (9.38) 


where Q, — oH H and rQ;'r" = IT. Now we use the result (Eq. C.3) from 
Appendix C for the determinant of a sum of I and a matrix product AB, choosing 
A = (z 1 — ^)*!H" to write 


a(z)a(z ')det[1 + pH (zI — ©) TT zI — $^)-H7] - 0. (9.39) 


If we replace z by z^' in Eq. (9.39), the result is unchanged because detA” = 


detA. Therefore, if z, is a characteristic root of the optimal system, so is the 
reciprocal z; !, and the desired relationship has been established. 

These 2n roots are those of the coupled x, À system described by Eq. (9.33), 
which describes the solution of the time-varying gain case. But the time-varying 
gain solution includes the portion where the gains are constant. Furthermore, 
during the constant-gain portion, the system can be described by Eq. (9.16) with 


u(k) = —K, x(&), 


and the roots of this simplified nth order description must be n of the 2n roots 
of Eq. (9.33). But which ones are they? The answer must be the n stable ones, 
because if any unstable roots were included, the value of J would be approaching 
infinity and would be far from optimal. 

Therefore we see that once the roots of Eq. (9.33) are found, the n stable 
ones are the roots of the optimal constant-gain case. 


Symmetric Root Locus 


An interesting special case of Eq. (9.39) occurs for SISO systems. In this case, the 
cost function J can be written as J = oy? + u? where y = Hx and Q, was set 
equal to |. Therefore, T =T and we see that H(zI — 4) T is the plant transfer 
function G (z). Eq. (9.39) reduces to 


1 + oG(z ')G(z) = 0, (9.40) 


and is the characteristic equation for SISO optimal control. It is an equation in 
root-locus form with respect to p, the parameter that reflects the relative weighting 
on output error y and control u. If p is small, the optimal roots are near the poles 
of the plant (or the stable reflections of the poles if G(z) is unstable), and as o 
gets large, the roots go toward the zeros of G(z~')G(z), which are inside the unit 
circle. 
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Symmetric Root Locus for Satellite Attitude Control 


Draw the symmetric root locus for the satellite attitude-control problem in Example 4.11 for 
= ].4 sec. Comment on the damping of the optimal controller vs. p. 


Solution. The discrete transfer function from Example 4.11 is 


(z 1) 
G(z) = ———35. 
(z — 
Replacing all the z's with z^! and multiplying top and bottom by z? results in 
z(z + 1) 
G!) = ———- 
(z — 1)? 


Therefore, the locus of optimal root locations versus the parameter p is determined by substi- 
tuting the two G ’s into Eq. (9.40) to arrive at 

z(z + 1)? 

— = 0 

(z—1)* 

The locus is drawn in Fig. 9.4. Note how the stable locus segments start from the open-loop 
poles and terminate at the zeros. Also note that, for each stable root, there is an unstable one 
that is its reciprocal. 

The optimal damping ratio for any p is ¢ = 0.7. Designers have always known that 
picking ¢ = 0.7 produced a good compromise between speed of response, overshoot, and use 
of control; it also turns out, for this example, to be the optimal solution. This result makes 
sense because the optimal formulation is attempting to do the same thing that designers have 
always tried to do, that is, find the roots that achieve a good balance between the output error 
and the use of control. 


1+ p 


9.3.3 


* 


Figenvector Decomposition 


Now let us return to the optimal constant-gain solution for the general case. We 
can solve Eq. (9.33) by transforming to a new state that has a diagonal system 
matrix, and from this solution we can obtain the steady-state optimal control. 
Just as before, the eigenvalues of this matrix are such that the reciprocal of every 
eigenvalue is also an eigenvalue. Therefore, half the roots of the characteristic 
equation must be inside the unit circle and half must be outside. In this case, 
therefore, H, can be diagonalized to the form* 


"E I 
e: ae 


4 In rare cases, H, will have repeated roots and cannot be made diagonal by a change of variables. [n 
those cases, a small change in Q, or Q, will remove the problem, or else we must compute the Jordan 
form for H? {sec Strang (1976)]. 
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Figure 9.4 
Symmetric root locus of Example 9.4 
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where E is a diagonal matrix of the unstable roots (|z| > 1) and E^! is a diagonal 
matrix of the stable roots (|z| < 1). HŽ is obtained by the similarity transformation 


Ht = WOULW, 


where W is the matrix of eigenvectors of H, and can be written in block form as 


"os ex 
"s. 


where 


[— —À 
>X 
U4 


376 Chapter9 Multivariable and Optimal Control 


is the matrix of eigenvectors associated with the eigenvalues (roots) outside the 


unit circle and 
X, 
A, 


is the matrix of eigenvectors associated with the eigenvalues of € that are inside 
the unit circle. 

This same transformation matrix, W, can be used to transform x and A to the 
normal modes of the system, that is, 


be [eue 


where x* and A* are the normal modes. Conversely, we also have 


x =w lla $ I 941 
Ei X x del SSH 


The solution to the coupled set of difference equations Eq. (9.33) can be 
simply stated in terms of the initial and final conditions and the normal modes, 
because the solution for the normal modes is given by 


x* E^" 0 | [ x* | 
A = s : (9.42) 
| À i. | 0 E" À 0 


To obtain the steady state, we let N go to infinity; therefore x' (N) goes to 
zero and, in general, A*(N) would go to infinity because each element of E is 
greater than one. So we see that the only sensible solution for the steady-state 
(N — oo) case is for A*(0) = 0 and therefore A*(k) = 0 for all &.? 

From Eqs. (9.41) and (9.42) with A*(k) z 0, we have 


x(k) = X,x'(k) = X,E™x* (0), (9.43) 
A(k) = A,x'(k) = A, E*x* (0). (9.44) 

Therefore Eq. (9.43) leads to 
x'(0) = E'X; x(k). (9.45) 


Thus, from Eqs. (9.44) and (9.45) 
E 
A(k) = A,X, x(k) = S,x(&), 
which is the same form as our assumption Eq. (9.20) for the sweep method, so 
we conclude that 
2 -1 
S. = A,X; (9.46) 
5 From Eq. (9.41) we see that if A* is not zero then the state x will grow in time and the system will 
be unstable. However, if the system is controllable we know that a control exists which will make the 


system stable and give a finite value to ,7. Because we have the optimal control in Eq. (9.33), it must 
follow that the optimal system is stable and A* = 0 if Q, is such that all the states affect J. 


€ Example 9.5 
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is the steady-state solution to (9.23), and that the control law for this system 
corresponding to J with N— oo is 


u(k) = -K x(k), (9.47) 
where, from Eqs. (9.46) and (9.28), 
K, = (Q, + I'S, Dr S ®. (9.48) 


Furthermore, from Eq. (9.31), the cost associated with using this control law is 


Se Fx OS,x(). 


In summary, the complete computational procedure is: 


1. Compute eigenvalues of the system matrix H, defined by Eq. (9.34). 
2. Compute eigenvectors associated with the stable (|z| < 1) eigenvalues of H, 


and call them 
X, 
A wie 


3. Compute control gain K from Eq. (9.48) with S,, given by Eq. (9.46). 


We have already seen that the stable eigenvalues from Step 1 above are the 
resulting system closed-loop roots with constant gain K „ from Step 3. We can 
also show that the matrix X, of Eq. (9.41) is the matrix of eigenvectors of the 
optimal steady-state closed-loop system. 

Most software packages (see MATLAB's dlqr.m for the discrete case as devel- 
oped here or lqr.m for the continuous case) use algorithms for these calculations 
that are closely related to the procedure above. In some cases the software gives 
the user a choice of the particular method to be used in the solution. Although it 
is possible to find the LOR gain K, for a SISO system by picking optimal roots 
from a symmetric root locus and then using Ackermann’s formula, it is easier to 
use the general lqr routines in MATLAB for either SISO or MIMO systems. If a 
locus of the optimal roots is desired, diqr.m can be used repetitively for varying 
values of elements in Q, or Q,- 


Optimal Root Locus for the Double Mass-Spring System 


Examine the optimal locus for the double mass-spring system of Appendix A.4 and comment 
on the relative merits of this approach vs. the pole-placement approach used in Example 8.3 
for this system. 


Solution. Using the same model and sample period as in Example 8.3, we find values for 
®, F which are then used with dlqr.m to solve for the closed-loop roots for various values of 
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the weighting matrix Q,. Because the output d is the only quantity of interest, it makes sense 
to use weighting on that state element only, that is 


q, 0 0 0 
0000 
Q-|o6 oug 
0000 


and Q, is arbitrarily selected to be 1. The locus is determined by varying the value of q,,. The 
MATLAB script 


q = logspace(-3,9,300), Q2=1 
for i=1:100, 
Q1=diag([q(i);0;0;0)); 
[k,s,e]=dlqr(phi,gam,Q1,Q2), 
end 


produced a series of values of the closed-loop system roots (e above) which were plotted on 
the z-plane in Fig. 9.5. Note that the roots selected in Example 8.3 that gave the best results 
lie at the points marked by stars on the stable portion of the locus. So we see that the optimal 
solution provides guidance on where to pick the oscillatory poles. In fact, the results from this 
design led to the selection of the pole locations for the second case in Example 8.3 that yielded 
the superior response. 


Figure 9.5 
Locus of optimal roots of 
Example 9.5 
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Cost Equivalents 


It is sometimes useful to be able to find the discrete cost function defined by 
Eq. (9.11), which is the equivalent to an analog cost function of the form 


1 NT 
4.75 i (x Q,,x +0’ Quar. S 


Having this equivalence relationship will allow us to compute the J, of two 
discrete designs implemented with different sample periods and, therefore, will 
provide a fair basis for comparison. It will also provide a method for finding a 
discrete implementation of an optimal, continuous design—an "optimal" version 
of the emulation design method discussed in Section 7.2. In this section, we will 
develop the cost equivalence, and the use of it for an emulation design will be 
discussed in Section 9.3.5. 

The integration of the cost in Eq. (9.49) can be broken into sample periods 
according to 


(k+1)T 
iba 5: J (x’Q..x+u’Q_u)dr, (9.50) 
and because 
X(kT +T) 2 P(t)x(kT) + l'(c)u(kT), (9.51) 
where 
P(T) =e" r(t) = i e"dqG 
0 
as indicated by Eq. (4.58), substitution of Eq. (9.51) into (9.50) yields 
— l V T T Q, Q, x(k) 
where 
KE Q, | -f l D(t) 0 Q, 0 | his Ed a 
Qa Q ule I 0 Q, 0 I j 
(9.53) 


Equation (9.53) is a relationship for the desired equivalent, discrete weighting 
matrices; however, we see that a complication has arisen in that there are now 
cross terms that weight the product of x and u. This can be circumvented by 
transforming the control to include a linear combination of the state, as we will 
show below; however, it is also possible to formulate the LQR solution so that it 
can account for the cross terms. 
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A method for computing the equivalent gains in Eq. (9.53), due to Van Loan 
(1978), is to form the matrix exponential 


-F' 0 Q, 0 
-G7 0 0 Q, a [3 4 
exp 0 0F G E = ks ©, , (9.54) 
0 0 0 0 
It will turn out that 
r 
Pa =| 0 I | 
du = By; 


however, one needs to calculate the matrix exponential Eq. (9.54) in order to find 


P, Because 
o, = o, | Q Q, | 


21 22 
we have the desired result 


| o" a l = b (9.55) 
Routines to calculate the discrete equivalent of a continuous cost are available in 
some of the CAD control design packages (see jdequiv.m in the Digital Control 
Toolbox). Furthermore, MATLAB has an LQR routine (called Iqrd.m) that finds 
the discrete controller for a continuous cost and computes the necessary discrete 
cost in the process. 

In summary, the continuous cost function J, in Eq. (9.49) can be computed 
from discrete samples of the state and control by transforming the continuous 
weighting matrices, Q..’s, according to Eq. (9.55). The resulting discrete weight- 
ing matrices include cross terms that weight the product of x and u. The ability 
to compute the continuous cost from discrete samples of the state and control is 
useful for comparing digital controllers of a system with different sample rates 
and will also be useful in the emulation design method in the next section. 


Emulation by Equivalent Cost 


The emulation design method discussed in Section 7.2 took the approach that the 
design of the compensation be done in the continuous domain and the resulting 
D(s) then be approximated using the digital filtering ideas of Chapter 6. This same 
approach can be applied when using optimal design methods [see Parsons (1982)]. 
First, an optimal design iteration is carried out in the continuous domain until the 


6 Note that superscript ( )" denotes transpose, whereas the entire matrix is multiplied by the sample 
period, T. 


€ Example 9.6 


Figure 9.6 
Degradations of s-plane 
root location for the 
optimal emulation 


method, Example 9.6 
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desired specifications are met. The discrete approximation is then obtained by 
calculating the discrete equivalent of the continuous cost function via Eq. (9.55), 
and then using that cost in the discrete LQR computation of Table 9.1, with 
modifications to account for the cross terms in the weighting matrices. These 
steps are all accomplished by MATLAB’s lqrd.m. 


Design by Equivalent Cost 


Examine the accuracy of the equivalent cost emulation method for the satellite attitude control 
example. 


Solution. The continuous representation of the system is specified by F and G from 
Eq. (4.47). Use of a continuous LQR calculation (Iqr.m in MATLAB) with 


1 0 
a= | al 


Q.» = [10000, 1, 0.5, 0.2, 0.1, 0.05, 0.02, 0.01] 


and 


results in s-plane roots exactly at ¢ = 0.7 as shown by the line in Fig. 9.6. Use of the 
Iqrd.m function computes the discrete controller that minimizes the same continuous cost, 
thus arriving at an emulation of the continuous design. The equivalent s-plane roots of these 
digital controllers are also plotted in Fig. 9.6. The figure shows that very little change in root 
locations occur by this emulation method. In fact the change in root locations is about 1% when 


##« Continuous design 
000 Emulation design 


Imag s 
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9.4 


sampling at six times the closed-loop natural frequency (w,/w, = 6) and increases to about 
10% for w/w, = 3. The accuracy of the zero-pole mapping emulation method was evaluated 
in Example 7.3; its calculations for a similar example showed that w/w, = 30 resulted in a 
10% change in root location, and w,/w, = 6 resulted in a 60% reduction in damping. 


* 


In general, use of the optimal emulation method will result in a digital 
controller whose performance will match the continuous design much closer than 
any of the emulation design methods discussed in Section 7.2 and Chapter 6. A 
requirement to use the method, however, is that the original continuous design be 
done using optimal methods so that the continuous weighting matrices Q, and 
Q, are available for the conversion because they are the parameters that define 
the design. 

As discussed in Chapter 7, emulation design is attractive because it allows for 
the design process to be carried out before specifying the sample rate. Sampling 
degrades the performance of any system to varying degrees, and it is satisfying 
to be able to answer how good the control system can be in terms of disturbance 
rejection, steady-state errors, and so on, before the sampling degradation is in- 
troduced. Once the characteristics of a reasonable continuous design are known, 
the designer is better equipped to select a sample rate with full knowledge of how 
that selection will affect performance. 

We acknowledge that the scenario just presented is not the reality of the 
typical digital design process. Usually, due to the pressure of schedules, the 
computer and sample rate are specified long before the controls engineers have 
a firm grasp on the controller algorithm. Given that reality, the most expedient 
path is to perform the design directly in the discrete domain and obtain the best 
possible with that constraint. Furthermore, many design exercises are relatively 
minor modifications to previous designs. In these cases, too, the most expedient 
path is to work directly in the discrete domain. 

But we maintain that the most desirable design scenario is to gain knowledge 
of the effects of sampling by first performing the design in the continuous domain, 
then performing discrete designs. In this case, the emulation method described 
here is a useful tool to obtain quickly a controller to be implemented digitally or 
to use as a basis for further refinement in the discrete domain. 


Optimal Estimation 


Optimal estimation methods are attractive because they handle multi-output sys- 
tems easily and allow a designer quickly to determine many good candidate 
designs of the estimator gain matrix, L. We will first develop the least squares 
estimation solution for the static case as it is the basis for optimal estimation, then 


9.4.1 


least squares estimate 
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we will extend that to the time-varying optimal estimation solution (commonly 
known as the “Kalman filter”), and finally show the correspondence between the 
Kalman filter and the time-varying optimal control solution. Following the same 
route that we did for the optimal control solution, we will then develop the op- 
timal estimation solution with a steady-state L-matrix. In the end, this amounts 
to another method of computing the L-matrix in the equations for the current 
estimator, Eqs. (8.33) and (8.34), which are 


K(k) = x(k) + L(y(k) — Y(&)), (9.56) 
where 
X(k + 1) = ®x(k) + Fuk); 


however, now L will be based on minimizing estimation errors rather than picking 
dynamic characteristics of the estimator error equation. 


Least-Squares Estimation 
Suppose we have a linear static process given by 
y= Hx +v, (9.57) 


where y is a p x ] measurement vector, xis ann x l unknown vector, visa p x | 
measurement error vector, and H is the matrix relating the measurements to the 
unknowns. We want to determine the best estimate of x given the measurements 
y. Often, the system is overdetermined; that is, there are more measurements in 
y than the unknown vector, x. A good way to find the best estimate of x is to 
minimize the sum of the squares of v, the fiterror. This is called the least squares 
solution. This is both sensible and very convenient analytically. Proceeding, the 
sum of squares can be written as 


rm sv M 50 — Hx)” (y — Hx) (9.58) 


and, in order to minimize this expression, we take the derivative with respect to 
the unknown, that is 


a = (y — Hx)'(-H) = 0 (9.59) 
which results in 
H'y = H'Hx, 
so that 
È = [H H] H" y (9.60) 


where X designates the “best estimate" of x. Note that the matrix to be inverted is 
n x n and that p must be > n for it to be full rank and the inverse to exist. If there 
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are fewer measurements than unknowns (p < 7), there are too few measurements 
to determine a unique value of x. 
The difference between the estimate and the actual value of x is 
&—x-[H'H] 'H' (Hx + v) — x 
= [H"H] 'H"v. (9.61) 
Equation (9.61) shows that, if v has zero mean, the error in the estimate, X — x, 
will also be zero mean, sometimes referred to as an unbiased estimate. 
The covariance of the estimate error, P, is defined to be 
P = Ef — x)(K—x)"} 
= £(H'H)'H'w'H(H'H) ^!) 
= (H'H)^H'£(w']H(H'H)"!. (9.62) 
If the elements in the noise vector, v, are uncorrelated with one another, £ (vv"]? 


is a diagonal matrix, which we shall call R. Furthermore, if all the elements of v 
have the same uncertainty, then all the diagonal elements of R are identical, and 


Elvy} = R = Ic?, (9.63) 


where ø is the rms value of each element in v. In this case, Eq. (9.62) can be 
written as 


P = (H'H)^!c? (9.64) 


and is a measure of how well we can estimate the unknown x. The square root of 
the diagonal elements of P represent the rms values of the errors in each element 
in x. 


€ Example 9.7 


Least-Squares Fit 
The monthly sales (in thousands $) for the first year of the Mewisham Co. are given by 
y7 2[020.5 1.112 1.1 1.3 1.1 1.2 2.0 1.222 4.0]. 


Findthe least-squares fit parabola to this data and use that to predict what the monthly sales will 
be during the second year. Also state what the predicted accuracy of the parabolic coefficients 
are, assuming that the rms accuracy of the data is $700. 


Solution. The solution is obtained using Eq. (9.60), where H contains the parabolic function 
to be fit. Each month's sales obeys 


2 
y; =a, +a,t, aftu, 


7 E is called the expectation and effectively means the average of the quantity in(  ]. 
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so that, in vector form 


2 
y 1 f f v 
Y lh a, v 
y [9 1 A ü a |+ v [| 
" a, . 
and we see that 
2 
L4 i 
Bl Fee | P AI? 
d s [sm a RB 
and 
a 
o 
x=| a, 
a 


Evaluation of Eq. (9.60) produces an estimate of x 


0.7432 
2 = | —0.0943 
0.0239 


which is used to plot the “best fit" parabola along with the raw data in Fig. 9.7. The data used 
to determine the parabola only occurred during the first 12 months, after that the parabola is 
an extrapolation. 


Figure 9.7 
Least-squares fit of 
parabola to data in 
Example 9.7 


y = 0.7432 — 0.0943: +:0.02397, ^^ 


z 


y, Sales (1000 $) 


t (Months) 
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Equation (9.64), with H as above and c? = 0.49 (o was given to be 0.7), shows that 


0.5234 —0.1670 0.0111 
P= | -0.1670 0.0655 —0.0048 |, 
0.0111 —0.0048 0.0004 


which means that the rms accuracy of the coefficients are 


o, = V0.5234 = 0.7235 


o 


c. = V0.0655 = 0.2559 


"n 


c. = 40.0004 = 0.0192. 


92 


Weighted Least Squares 


In many cases, we know a priori that some measurements are more accurate than 
others so that all the diagonal elements in R are not the same. In this case, it 
makes sense to weight the measurement errors higher for those measurements 
known to be more accurate because that will cause those measurements to have 
a bigger influence on the cost minimization. In other words, the cost function in 
Eq. (9.58) needs to be modified to 


kic avy, (9.65) 


where W is a diagonal weighting matrix whose elements are in some way 
inversely related to the uncertainty of the corresponding element of v. Performing 
the same algebra as for the unweighted case above, we find that the best weighted 
least squares solution is given by 


& = [H’ WH] H" Wy. (9.66) 


The covariance of this estimate also directly follows the development of Eq. (9.62) 
and results in 


P = (H’ WH)"'H’ WE {vv }WH(H’ WH). (9.67) 


A logical choice for W is to let it be inversely proportional to R, thus weighting 
the square of the measurement errors exactly in proportion to the inverse of their 
a priori mean square error, that is, let 


W-RK' (9.68) 


This choice of weighting matrix is proven in Section 12.7 to minimize the trace 
of P and is called the best linear unbiased estimate. With this choice, Eq. (9.66) 
becomes 


x= [H'RH|'H?R 'y, (9.69) 


9.4 Optimal Estimation 387 


and Eq. (9.67) reduces to 
P = (H'R^'H)'. (9.70) 


Recursive Least Squares 


The two least-squares algorithms above, Eqs. (9.60) and (9.69), are both batch 
algorithms in that all the data is obtained and then processed in one calculation. 
For an estimation problem that runs for a long time, the measurement vector 
would become very large, and one would have to wait until the problem was 
complete in order to calculate the estimate. The recursive formulation solves 
both these difficulties by performing the calculation in small time steps. The 
ideas are precisely the same, that is, a weighted least squares calculation is being 
performed. But now we break the problem into old data, for which we already 
found X, and new data, for which we want a correction to X, so that the overall 
new X, is adjusted for the newly acquired data. 
The problem is stated as 


FIBEACUMI (9.71) 


where the subscript o represents old data and n represents new data. The best 
estimate of x given all the data follows directly from Eq. (9.69) and can be written 
as 


T F 
H, R; 0 H, $ H, R;' 0 y, 

| H, | l 0 R;! | | H, IE mid | H | | 0 R | | y, , (9.72) 

where X is the best estimate of x given all the data, old and new. Let's define X as 


& =X, + 6X, (9.73) 


where x, is the best estimate given only the old data, y,. We want to find an 
expression for the correction to this estimate, 6X, given the new data. Since X, 
was the best estimate given the old data, it satisfies 


[H]R;'H,)&, = HjR; y,. (9.74) 
Expanding out the terms in Eq. (9.72) and using Eqs. (9.73) and (9.74) yields 
H/R;'H,&, + [HIR] H, + H' R;H,]jX = HR; y,, (9.75) 


which can be solved for the desired result 
ôk = [H7 R;'H, + H/R.H,]'H/R; (y, - H,x,). (9.76) 


Equation (9.70) defined the covariance of the estimate, and in terms of the old 
data would be written as 


P, — (H/R;'H,), (9.77) 
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procedure 


so that Eq. (9.76) reduces to 
ôk = [P;! + HT R'H,]' HT R; (y, — H,&,). (9.78) 


and we see that the correction to the old estimate can be determined if we 
simply know the old estimate and its covariance. Note that the correction to x is 
proportional to the difference between the new data y, and the estimate of the 
new data H, €, based on the old x. 

By analogy with the weighted least squares, the covariance of the new esti- 
mate is 


P, -[P; +H R HT ' (9.79) 


Note here that it is no longer necessary for there to be more new measurements 
than the elements in x . The only requirement is that P, be full rank, which could 
be satisfied by virtue of P, being full rank. In other words, in Example 9.7 it 
would have been possible to start the process with the first three months of sales, 
then recursively update the parabolic coefficients using one month's additional 
sales at a time. In fact, we will see in the Kalman filter that it is typical for the 
new y to have fewer elements than x. 

To summarize the procedure, we start by assuming that x, and P, are available 
from previous calculations. 


e Compute the new covariance from Eq. (9.79) 
Ep HHROH) 
e Compute the new value of X using Eqs. (9.73), (9.78), and (9.79) 
££ +P HIR (y, — H,x,). (9.80) 


n o 


e Take new data and repeat the process. 


This algorithm assigns relative weighting to the old X vs. new data based on their 
relative accuracies, similarly to the weighted least squares. For example, if the 
old data produced an extremely accurate estimate so that P, was almost zero, 
then Eq. (9.79) shows that P, is = 0 and Eq. (9.80) shows that the new estimate 
will essentially ignore the new data. On the other hand, if the old estimates are 
very poor, that is, E is very large, Eq. (9.79) shows that 


P, = [H; R;'H,]' ^ 
and Eq. (9.80) shows that 
& = [H; R; BH Rey 


n! 


which ignores the old data. Most cases are somewhere between these two ex- 
tremes, but the fact remains that this recursive-weighted-least-squares algorithm 
weights the old and new data according to the associated covariance. 


9.4.2 
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The Kalman Filter 
Now consider a discrete dynamic plant 
x(k + 1) = Ox(k) + Puck) + P, w(k) (9.81) 
with measurements 
y(k) = Hx(k) + v(k), (9.82) 


where the process noise w(k) and measurement noise v(k) are random se- 
quences with zero mean, that is 


E{w(k)) = E{v(k)} — 0, 
have no time correlation or are “white” noise, that is 
E{w(i)w’ (j)} 26twv'g)20  ifizj, 
and have covariances or mean square “noise levels” defined by 
£(w(k)w' (k)) = R,, E{v(k)v’ (k)} =R.. 


We allow L in Eq. (9.56) to vary with the time step, and we wish to pick L(k) so 
that the estimate of x(k), given all the data up to and including time k, is optimal. 

Let us pretend temporarily that without using the current measurement y (X), 
we already have a prior estimate of the state at the time of a measurement, which 
we will call X(k). The problem at this point is to update this old estimate based 
on the current new measurement. 

Comparing this problem to the recursive least squares, we see that the esti- 
mation measurement equation Eq. (9.82) relates the new measurements to x just 
as the lower row in Eq. (9.71) does; hence the optimal state estimation solution 
is given by Eq. (9.80), where x, takes the role of x(k), P, = P(k), H, = H, and 
R, — R,. The solution equations are 


X(k) = x(k) + L(&)(y(k) — Hx(K)), (9.83) 
where, 
L(k) = P(K)H'R;'. (9.84) 


Equation (9.79) is used to find P(k), where we now call the old covariance M(k) 
instead of R thus 


P(k) = [M^ + H'R;'H]''. (9.85) 


The size of the matrix to be inverted in Eq. (9.85) is n x n, where n is the 
dimension of x . For the Kalman filter, y usually has fewer elements than x , and 
itis more efficient to use the matrix inversion lemma (See Eq. (C.6) in Appendix 
C) to convert Eq. (9.85) to 


P(k) = M(k) - M()H' (HM(OH' + R,)~'HM(k). (9.86) 
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M(k) is the covariance (or expected mean square error) of the state estimate, 
x(k), before the measurement. The state estimate after the measurement, x(k), 
has an error covariance P(k). 

The idea of combining the previous estimate with the current measurement 
based on the relative accuracy of the two quantities—the recursive least-squares 
concept—was the genesis for the relationships in Eq. (9.83) and Eq. (9.85) and 
is one of the basic ideas of the Kalman filter. The other key idea has to do with 
using the known dynamics of x to predict its behavior between samples, an idea 
that was discussed in the development of the estimator in Chapter 8. 

Use of dynamics in the propagation of the estimate of x between k — 1 and k 
did not come up in the static least-squares estimation, but here this issue needs to 
be addressed. The state estimate at k — 1, given data up through k — 1, is called 
x(k — 1), whereas we defined x(k) to be the estimate at k given the same data up 
through k — 1. In the static least squares, these two quantities were identical and 
both called x, because it was presumed to be a constant; but here the estimates 
differ due to the fact that the state will change according to the system dynamics 
as time passes. Specifically, the estimate x(k) is found from x(k — 1) by using 
Eq. (9.81) with w(k — 1) — 0, because we know that this is the expected value 
of x(k) since the expected value of the plant noise, €{w(k — 1)) is zero. Thus 


x(k) = X(k — 1) + Pu(k — 1). (9.87) 


The change in estimate from X(k — 1) to x(k) is called a “time update,” whereas 
the change in the estimate from x(k) to (k) as given by Eq. (9.83) is a “measure- 
ment update,” which occurs at the fixed time k but expresses the improvement 
in the estimate due to the measurement y(k). The same kind of time and mea- 
surement updates apply to the estimate covariances, P and M: P represents the 
estimate accuracy immediately after a measurement, whereas M is the propagated 
value of P and is valid just before measurements. From Eq. (9.81) and Eq. (9.87) 
we see that 


x(k 4-1) — x(k +1) = (x(k) — x(k)) + T w(k), (9.88) 


which we will use to find the covariance of the state at time k + 1 before taking 
y(k + 1) into account 


M(k + 1) 2 €[(x(k +1) — x(k + D)(x(k + 1) — x(k + )7]. 


If the measurement noise, v, and the process noise, w, are uncorrelated so that 
x(k) and w(k) are also uncorrelated, the cross product terms vanish and we find 
that 


M(k + 1) = ELB(x(k) — &())(x() — &(0) + P, wüow' ()TT1. (9.89) 
But because 


P(k) = E{(x(k) — &(0)(x(k) — Ê(k))"} and R, = £(w(Ow" (4), 


Kalman filter equations 
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Eq. (9.89) reduces to 
M(k + 1) = 9P()9" +r RTT. (9.90) 


This completes the required relations for the optimal, time-varying gain, state 
estimation, commonly referred to as the Kalman filter. A summary of the required 
relations is: 


e At the measurement time (measurement update) 


K(k) = K(k) + POOH"R; '(y(k) — Hx(k)), (9.91) 
where 
P(k) = M(k) — M(OH'(HM(OH' + R) HM (k). (9.92) 
e Between measurements (time update) 
x(k + 1) = Bx(k) + Puck) (9.93) 
and 
M(k +1) = ®P(k)®’ +T RIT, (9.94) 


where the initial conditions for X(0) and M(0) = €{x(0)x"(0)} must be 
assumed to be some value for initialization. 


Because M is time-varying, so will be the estimator gain, L, given by 
Eq. (9.84). Furthermore, we see that the structure of the estimation process 
is exactly the same as the current estimator given by Eq. (9.56), the difference 
being that L is time varying and determined so as to provide the minimum esti- 
mation errors, given a priori knowledge of the process noise magnitude, R, the 
measurement noise magnitude, R,, and the covariance initial condition, M(0). 


€ Example 9.8 


Time-Varying Kalman Filter Gains 


Solve for L(&) for the satellite attitude control problem in Example 9.3 assuming the angle, 0, 
is sensed with a measurement noise covariance 


2 
R, — 0.1 deg’. 


Assume the process noise is due to disturbance torques acting on the spacecraft with the 
disturbance-input distribution matrix 
0 
asl? 


and assume several values for the mean square magnitude of this disturbance: 


R,, = 0.001, 0.01, 0.1 (deg?/sec^). 
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Figure 9.8 
Example of estimator 
gains versus time 


Solution. Because only @ is directly sensed, we have from Eq. (4.47) 
H= E QJ 


The time varying estimator gain, L(k), is found by evaluating Eqs. (9.84), (9.92), and (9.94). In 
order to start these recursive equations, some value for M(0) is required. Although somewhat 


arbitrary, a value of 
120 
M(0) = | 0 1 | 


was selected in order to indicate that the initial rms uncertainty in 0 was 1° and 6 was 1°/sec. 
The gain time histories are sensitive to this initial condition estimate during the initial transient, 
but the steady final values are not affected. 

Figure 9.8 shows the time history of the L’s for the values given above. We see from the 
figure that after an initial settling time, the estimator gains essentially reach a steady state. Just 
as for the control problem, where the feedback gain reached a steady value early in the problem 
time, the eventual steady value for L occurs for all linear constant coefficient systems. The 
subject of the next section is a method to compute the value of this steady-state gain matrix so 
that it can be used in place of the time-varying one, thus eliminating the need to go through 
the rather lengthy recursive computation of Eq. (9.92) and Eq. (9.94), 


+ 


Given an actual design problem, one can often assign a meaningful value to 
R,, which is based on the sensor accuracy. The same cannot be said for R,. The 


divergent filters 
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assumption of white process noise is often a mathematical artifice that is used 
because of the ease of solving the resulting optimization problem. Physically, R, 
is crudely accounting for unknown disturbances, whether they be steps, white 
noise, or somewhere in between, and for imperfections in the plant model. 

If there is a random disturbance that is time correlated—that is, colored 
noise—it can be accurately modeled by augmenting ® with a coloring filter that 
converts a white-noise input into time-correlated noise, thus Eq. (9.81) can also 
be made to describe nonwhite disturbances. In practice, however, this is often not 
done due to the complexity. Instead, the disturbances are assumed white, and the 
noise intensity is adjusted to give acceptable results in the presence of expected 
disturbances, whether time-correlated or not. 

If R, was chosen to be zero due to a lack of knowledge of a precise noise 
model, the estimator gain would eventually go to zero. This is so because the 
optimal thing to do in the idealistic situation of no disturbances and a perfect plant 
model is to estimate open loop after the initial condition errors have completely 
died out; after all, the very best filter for the noisy measurements is to totally ignore 
them. In practice, this will not work because there are always some disturbances 
and the plant model is never perfect; thus, the filter with zero gain will drift away 
from reality and is referred to as a divergent filter. If an estimator mode with 
zero gain was also naturally unstable, the estimator error would diverge from 
reality very quickly and likely result in saturation of the computer. We therefore 
are often forced to pick values of R, and sometimes D, “out of a hat" in the 
design process in order to assure that no modes of the estimator are without 
feedback and that the estimator will track all modes of the actual system. The 
disturbance noise model should be selected to approximate that of the actual 
known disturbances when practical, but the designer often settles on acceptable 
values based on the quality of the estimation that results in subsequent simulations 
including all known disturbances, white and otherwise. 

It is possible to include a nonzero process noise input in the plant model 
and yet still end up with a divergent filter [Bryson (1978)]. This can arise when 
the process noise is modeled so that it does not affect some of the modes of the 
system. Bryson showed that the filter will not be divergent if you select I’, so that 
the system, (®, I”), is controllable and all diagonal elements of R, are nonzero. 

For an implementation of a time-varying filter, initial conditions for M and x 
are also required. Physically, they represent the a priori estimate of the accuracy 
of x(0), which in turn represents the a priori estimate of the state. In some cases, 
there might be test data of some kind to support their intelligent choice; however, 
that is not typical. In lieu of any better information, one could logically assume 
that the components of x(0) contained in y are equal to the first measurement, and 
the remaining components are equal to zero. Similarly, the components in M that 


8 A filter with one or more modes that is running open loop is sometimes dubbed “oblivious” or “fat, 
dumb, and happy" because it ignores the measurements. It is somewhat analogous to the person whose 
mind is made up and not interested in facts. 
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represent a measured component could be logically set to R, and the remaining 
components set to a high value. 

In obtaining the values of L in Fig. 9.8, it was necessary to solve for the 
time history of P. Because P is the covariance of the estimation errors, we can 
sometimes use P as an indicator of estimation accuracy, provided that the values 
of R, and R, are based on some knowledge of the actual noise characteristics 
and that w and v are approximately white. 

The intent of this section and example is to give some insight into the nature 
of the solution so as to motivate and provide a basis for the following section. 
Readers interested in the application of Kalman filters are encouraged to review 
works devoted to that subject, such as Bryson and Ho (1975), Anderson and 
Moore (1979), and Stengel (1986). 


Steady-State Optimal Estimation 


As shown by Example 9.8, the estimator gains will eventually reach a steady- 
state value if enough time passes. This is so because the values of M and P 
reach a steady value. Because of the substantial simplification in the controller 
afforded by a constant estimator gain matrix, it is often desirable to determine 
the constant gain during the design process and to implement that constant value 
in the controller. As discussed in Section 9.2, many control systems run for very 
long times and can be treated mathematically as if they run for an infinite time. 
In this case the constant gain is the optimal because the early transient period has 
no significant effect. Whatever the motivation, a constant-gain Kalman filter is 
identical in structure to the estimator discussed in Chapter 8, the only difference 
being that the gain, L, is determined so that the estimate errors are minimized for 
the assumed level of process and measurement noise. This approach replaces the 
pole-placement method of finding the estimator gain and has the highly desirable 
feature that it can be applied to MIMO systems. 

The equations to be solved that determine M and P are Eq. (9.92) and 
Eq. (9.94). Repeated, they are 


P(k) = M(k) — M(k)H7(HM(k)H? + R,)'HM(k), (9.92) 
M(k + 1) = ®P(K)®? +r RT (9.94) 


Comparing Eqs. (9.92) and (9.94) to the optimal control recursion relationships, 
Eq. (9.24) and Eq. (9.25) 


M(k) = S(k) - SOT IQ, + T"SQOT] T" S), (9.25) 
S(k) = &'M(k + 6 4- Q,, (9.24) 


we see that they are precisely of the same form! The only exception is that 
Eq. (9.94) goes forward instead of backward as Eq. (9.24) does. Therefore, we 
can simply change variables and directly use the steady-state solution of the 


LQG 


Table 9.1 
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control problem as the desired steady-state solution to the estimation problem, 
even though the equations are solved in opposite directions in time. 

Table 9.1 lists the correspondences that result by direct comparison of the con- 
trol and estimation recursion relations: Eq. (9.25) with Eq. (9.92) and Eq. (9.24) 
with Eq. (9.94). 

By analogy with the control problem, Eqs. (9.92) and (9.94) must have 
arisen from two coupled equations with the same form as Eq. (9.33). Using the 
correspondences in Table 9.1, the control Hamiltonian in Eq. (9.34) becomes the 
estimation Hamiltonian 
E $'-H'R,;Ho9 T R,T] -H'R;Ho" 

m -$ 'nR,T o' 


Therefore, the steady-state value of M is deduced by comparison with 
Eq. (9.46) and is 


(9.95) 


e 


M, = A,X;', (9.96) 


A, 
are the eigenvectors of H, associated with its stable eigenvalues. Hence, from 


Eqs. (9.92) and (9.84) after some manipulation we find the steady state Kalman- 
filter gain to be? 


where 


L,, = M,H'(HM,.H' +R)”. (9.97) 


This is a standard calculation in MATLAB's kalman.m. Sometimes this solution is 
referred to as the linear quadratic Gaussian (LQG) problem because it is often 
assumed in the derivation that the noise has a Gaussian distribution. As can be 
seen from the development here, this assumption is not necessary. However, with 
this assumption, one can show that the estimate is not only the one that minimizes 


Control and Estimation Duality 


Control Estimation 


o [4 
M P 

S M 
Q, PRY 
r H7 
Q: R, 


9 In the steady state, the filter has constant coefficients and, for the assumed model, is the same as the 
Wiener filter. 
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the error squared, but also the one that is the statistically “most likely.” In this 
derivation, the result is referred to as the maximum likelihood estimate. 

Because Eq. (9.34) and Eq. (9.95) are the same form, the eigenvalues have 
the same reciprocal properties in both cases. Furthermore, for systems with a 
single output and a single process noise input, the symmetric root locus follows 
by analogy with Eq. (9.40) and the use of Table 9.1. Specifically, the characteristic 
equation becomes 


14+ 9G6,(z-')G,(z) =0, (9.98) 
where q = R,/ R, and 
G,(z) = HI- 9) T ,. 


Therefore, for systems where the process noise is additive with the control input— 
that is, L = T , —the control and estimation optimal root loci are identical, and the 
control and estimation roots could be selected from the same loci. For example, 
the loci in Figs. 9.4 and 9.5 could be used to select estimator roots as well as the 
control roots. 


Noise Matrices and Discrete Equivalents 


The quantities defining the noise magnitude, the covariance matrices R, and R,, 
were defined in Section 9.4.2 as 


R, —£(w()w'(k)) and R, = E{v(k)v" (&)). 


Typically, if there is more than one process or measurement noise component, one 
has no information on the cross-correlation of the noise elements and therefore 
R,, and R, are selected as diagonal matrices. The magnitudes of the diagonal 
elements are the variances of the noise components. 


Process Noise, R, 


The process noise acts on the continuous portion of the system and, assuming 
that it is a white continuous process, varies widely throughout one sample period. 
Its effect over the sample period, therefore, cannot be determined as it was for 
Eq. (9.90); instead, it needs to be integrated. From Eqs. (4.55) and (4.56), we see 
that the effect of continuous noise input over one sample period is 


T 
x(k + 1) = ®x(k) + / e"?G w(n)dn, 
0 
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where G, is defined by Eq. (4.45). Repeating the derivation’? of Eq. (9.90) with 
the integral above replacing r w(k) in Eq. (9.88), we find that Eq. (9.90) becomes 


y T 
M(k + 1) = BP(k)@” «f f $(7)G,£[w(2)w! (t)]G; ' (t)dtdn. 
0 0 


(9.99) 
But the white noise model for w means that 


E[w(n)w7 (7)] = Ry 3 — 7), 


where R wps 4 İS called the power spectral density, or mean-square spectral density, 
of the continuous white noise. Therefore, Eq. (9.99) reduces to 


M(k + 1) = PP (k) +C,, 


where 
T 
C = f P(1)G R Gi 9" (Cdr. (9.100) 
0 


Calculation of this integral (see disrw.m in the Digital Control Toolbox) can 
be carried out using a similar exponential form due to Van Loan (1978), as in 
Eq. (9.54). If T is very short compared to the system time constants, that is 


$I and T, zn, 
then the integral is approximately 


C, S TG RoG, 


wpsd 
which can also be written'! 
R 
WS wpsd 447 
GIL E hs 1 
Therefore, one can apply the discrete covariance update Eq. (9.94) to the case 
where w is continuous noise by using the approximation that 


r= == (9.101) 


w 


as long as T is much shorter than the system time constants. If this assumption 
is not valid, one must revert to the integral in Eq. (9.100). 

In reality, however, there is no such thing as white continuous noise. A pure 
white-noise disturbance would have equal magnitude content at all frequencies 
from 0 to oo. Translated, that means that the correlation time of the random 
signal is precisely zero. The only requirement for our use of the white-noise 


10 See Stengel (1986), p. 327. 


11 By computing I’, exactly according to Eq. (4.58), the approximation that follows for C, is substantially 
more accurate than the approximation using G,. 
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model in discrete systems is that the disturbance have a correlation time that is 
short compared to the sample period. If the correlation time is on the same order 
or longer than the sample period, the correct methodology entails adding the 
colored-noise model to the plant model and estimating the random disturbance 
along with the original state. In fact, if the correlation time is extremely long, the 
disturbance acts much like a bias, and we have already discussed its estimation in 
Section 8.5.2. In practice, disturbances are often assumed to be either white or a 
bias, because the improved accuracy possible by modeling a random disturbance 
with a time constant on the same order as the sample period is not deemed worth 
the extra complexity. 

The determination of the appropriate value of R, s that represents a physical 
process is aided by the realization that pure white noise does not exist in nature. 
Disturbances all have a nonzero correlation time, the only question being: How 
short? Assuming that the time correlation is exponential with a correlation time 
t,, and that t, < system time constants, the relation between the power spectral 
density and the mean square of the signal is" 


Rua = 27, Elw (t)). (9.102) 


wpsd ^— 


Typically, one can measure the mean square value, £(w?(t)), and can either 
measure or estimate its correlation time, t,, thus allowing the computation of 
each diagonal element of Rosa" However, the desired result for our purposes 
is the discrete equivalent noise, R,. It can be computed from Eq. (9.101) and 
Eq. (9.102), where each diagonal element, {R ];, is related to the mean square 
and correlation time of the ith disturbance according to 


2 2 
[R,], = Flr, E(w], (9.103) 


Note from Eq. (9.103) that R, and € (w^ (r)) are not the same quantities. Specifi- 
cally, the diagonal elements of R, are the mean square values of the discrete noise, 
w(k), that produces a response from the discrete model given by Eq. (9.90) that 
matches the response of the continuous system acted on by a w(t) with E (w^(r)) 
and t,. Note also that it has been assumed that the noise is white compared to 
the sample period, that is, t, « T and that T < any system time constants. 
Under these conditions the discrete equivalent mean square value is less than the 
continuous signal mean square because the continuous random signal acting on 
the continuous system is averaged over the sample period. If t, is not < T, then 
the noise is not white and Eq. (9.100) is not valid; thus calculation of R, is not 
relevant. 


12 Bryson and Ho (1975), p. 331. 
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Sensor Noise, R, 


The pertinent information given by the manufacturer of a sensor product would 
be the rms “jitter” error level (or some other similar name), which can usually be 
interpreted as the random component and assumed to be white, that is, uncorre- 
lated from one sample to the next. The rms value is simply squared to arrive at 
the diagonal elements of £ (v^(t)). Unlike the process noise case these values are 
used directly, that is 


[R,], = CONE (9.104) 


The assumption of no time correlation is consistent with the development of the 
optimal estimator that was discussed in Section 9.4.2. If the correlation time of 
a sensor is longer than the sample period, the assumption is not correct, and an 
accurate treatment of the noise requires that its “coloring? model be included 
with the plant model and the measurement noise error estimated along with the 
rest of the state.'? One could also ignore this complication and proceed as if the 
noise were white, with the knowledge that the effect of the measurement noise 
is in error and that skepticism is required in the interpretation of estimate error 
predictions based on P. Furthermore, one could no longer claim that the filter 
was optimal. 

Sensor manufacturers typically list bias errors also. This component should 
at least be evaluated to determine the sensitivity of the system to the bias, and if 
the effect is not negligible, the bias should be modeled, augmented to the state, 
and estimated using the ideas of Section 8.5.2. 

Note that neither the sample period nor the correlation time of the sensor 
error has an impact on R, if v is white. Although the rms value of v is not affected 
by the sampling, sampling at a higher rate will cause more measurements to be 
averaged in arriving at the state estimate, and the estimator accuracy will improve 
with sample rate. 

In some cases, the designer wishes to know how the sensor noise will affect 
an estimator that is implemented with analog electronics. Although this can 
be done in principle, in practice it is rarely done because of the low cost of 
digital implementations. The value of analyzing the continuous case is that the 
knowledge can be useful in selecting a sample rate. Furthermore, the designer is 
sometimes interested in creating a digital implementation whose roots match that 
of a continuous design, and finding the discrete equivalent noise is a method to 
approximate that design goal. Whatever the reason, the continuous filter can be 
evaluated digitally, with the appropriate value of R, being the one that provides 
the discrete equivalent to the continuous process, the same situation that was 


13 See Stengel (1986) or Bryson and Ho (1975). 
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examined for the process noise. Therefore, the proper relation in this special case 
is from Eq. (9.101) or (9.103) 


R, sd 2 

R, -— or [R]; = Žir EO), (9.105) 
T Li T c t 

where £ (v? (1)) is the mean square value of the sensor noise and t, is itscorrelation 

time. Alternatively, if one desires only the continuous filter performance for a 

baseline, one can use a pure continuous analysis of the filter," which requires 

only Ry a 


Multivariable Control Design 


The elements of the design process of a MIMO system have been discussed in 
the preceding sections. This section discusses some of the issues in design and 
provides two examples of the process. 


Selection of Weighting Matrices Q, and Q, 


As can be seen from the discussion in Sections 9.2 and 9.3, the selection of Q, 
and Q, is only weakly connected to the performance specifications, and a certain 
amount of trial and error is usually required with an interactive computer program 
before a satisfactory design results. There are, however, a few guidelines that can 
be employed. For example, Bryson and Ho (1975) and Kwakernaak and Sivan 
(1972) suggest essentially the same approach. This is to take Q, = H'Q,H so 
that the states enter the cost via the important outputs (which may lead to H = I 
if all states are to be kept under close regulation) and to select Q, and Q, to be 
diagonal with entries so selected that a fixed percentage change of each variable 
makes an equal contribution to the cost. For example, suppose we have three 
outputs with maximum deviations m,, m,, and m,. The cost for diagonal Q, is 


gm + G3 D qs 

The rule is that if y, = om,, y, = &m,, and y, = am,, then 
qui = 15) = 1X3 

thus 


- om m gh 23 
q,,0 m, = J, m; = q40 M}. 


14 See Bryson and Ho (1975). 


15 Kwakernaak and Sivan (1972) suggest using a percentage change from nominal valucs: Bryson and 
Ho use percentage change from the maximum values. 
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A Satisfactory solution for elements of Q, is then '° 


OQ, = 1yfpi sca. Oi = ne (9.106) 
Similarly, for Q, we select a matrix with diagonal elements 
Ooty = Wimax» Q222 = Meu (9.107) 


There remains a scalar ratio between the state and the control terms, which we 
will call p. Thus the total cost is 


J = px' H'Q,Hx + u’ Q,u, (9.108) 


where Q, and Q, are given by Eq. (9.106), and Eq. (9.107), and p is to be selected 
by trial and error. A computer-interactive procedure that allows examination of 
root locations and transient response for selected values of Q, and Q, expedites 
this process considerably. 


Pincer Procedure 


The designer can introduce another degree of freedom into this problem by 
requiring that all the closed-loop poles be inside a circle of radius 1/o, where 
a > 1. If we do this, then the magnitude of every transient in the closed loop 
will decay at least as fast as | /a*, which forms pincers around the transients and 
allows a degree of direct control over the settling time. We can introduce this 
effect in the following way. 

Suppose that as a modification to the performance criterion of Eq. (9.11), we 
consider 


J, = Ix Qx +u" Q,ujo, (9.109) 
k=0 
We can distribute the scalar term œ% in Eq. (9.109) as o*o* and write it as 


J, = 1o)" Q, (o^x) + (a^u)' Q, (a^u)] 


=0 
[27Q,z+v’Q,v], (9.110) 


k 
k 


© 


where 
z(k) =a*x(k),  v(k) = œ*u(k). (9.111) 
The equations in z and v are readily found. Consider 


2(k +1) 2 o**!x(k + 1). 


16 Sometimes called Bryson’s rules. 
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From Eq. (9.10) we have the state equations for x(k + 1), so that 
z(k + 1) = o**![dx(k) + F'u(X)]. 


kl 


If we multiply through by the o^ -term, we can write this as 


z(k + 1) = a(o*x(k)) + aE(o*u(k)), 
but from the definitions in Eq. (9.111), this is the same as 
z(k + 1) = «®z(k) + aD v(k). (9.112) 


The performance function Eq. (9.110) and the equations of motion Eq. (9.112) 
define a new problem in optimal control for which the solution is a control law 


v = —Kz, 
which, if we work backward, is 
o*u(k) = —K(a*x(k)) 
or 
u(k) = —Kx(k). (9.113) 


We conclude from all this that if we use the control law Eq. (9.113) in the state 
equations (9.10), then a trajectory results that is optimal for the performance J, 
given by Eq. (9.109). Furthermore, the state trajectory satisfies Eq. (9.111), where 
z(k) is a stable vector so that x(k) must decay ar least as fast as 1/a*, or else 2(k) 
could not be guaranteed to be stable. 

To apply the pincers we need to relate the settling time to the value of o. 
Suppose we define settling time of x, as that time t, such that if x (0) = 1 and all 
other states are zero at k = 0, then the transients in x, are less than 0.01 (1% of 
the maximum) for all times greater than ¢,. If we approximate the transient in x 
as 


x,00 ~ x, (0((1/0)*, 
then when kT = £,, we must have 

x, (kT) < 0.01x,(0), 
which will be satisfied if o is such that 


: | 
1/a)* € 0.01 = —- 
(1/a) < 109" 


or 
a > 100"* = 10075. (9.114) 


In summary, application of the pincer procedure requires that the designer 
select the settling time, fy; within which all states should settle to less than 1%. 
Equation (9.114) is then used to compute o and, according to Eq. (9.112), the 
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revised system &® and aI for use in an LQR computation for the feedback gain 
matrix K. Use of K with the original system, ®, will produce a response of all 
states that settle within the prescribed 1.. 


Paper-Machine Design Example 


As an illustration of a multivariable control using optimal control techniques, we 
will consider control of the paper-machine head box described in Appendix A.5. 
The continuous equations of motion are given by 


—0.2 0.1 1 Oy 1 
x=| -005 0 O |x-| 0 07 Ju (9.115) 
0 0 -l 1 0 


We assume the designer has the following specifications on the responses of this 
system: 


1. The maximum sampling frequency is 5 Hz (T = 0.2 sec). 

2. The 196 settling time to demands on x, (— total head) should be less than 2.4 
sec (12 periods). 

3. The settling time to demands on x,(= liquid level) should be less than 8 sec 
(40 periods). 

4. The units on the states have been selected so that the maximum permissible 
deviation on total head, x,, is 2.0 units and the liquid level, x,, is 1.0. 


5. The units on control have been selected so that the maximum permissible 
deviation on u, (air control) is 5 units and that of u, (stock control) is 10 
units. 


First let us apply the pincer procedure to ensure that the settling times are 
met. We have asked that the settling times be 2.4 sec for x, and 8 sec for x,. If, for 
purposes of illustration, we select the more stringent of these and in Eq. (9.114) 
set t, = 2.4 for which t /T = 12, then 


æ > 100"? = 1.47. 


Now let us select the cost matrices. Based on the specifications and the 
discussion in Section 9.5.1, we can conclude that m, = 2 and m, = 1, and thus 


- [028 0 
à,-| 0 it 


Because we are interested only in x, and x,, the output matrix H in Eq. (9.108) is 


100 
1-|5 10] 
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and 


Furthermore, because Uy max = 5 and u 


Conversion of the continuous system in Eq. (9.115) to a discrete one (7 = 0.2 
sec) yields 


0.9607 0.0196 0.1776 0.0185 0.1974 
m= | —0.0098 0.9999 —0.0009 and PF = | —0.0001 0.1390 
0 0 0.8187 0.1813 0 


These two matrices are then multiplied by the scalar 1.47 (= œ) and used in 
MATLAB's diqr.m with the preceding Q, and Q,. 
The LQR calculation gives a control gain 


k-[58 -979 379 
=| 095 4.94 0.10 |^ 


and closed-loop poles at 
z = 0.108, 0.491 + j0.068, 


which are well within 1/1.47 = 0.68. Fig. 9.9(a) shows the transient response 
to unit-state initial conditions on x,, and Fig. 9.9(b) shows the same for initial 
conditions on x,. Examination of these results shows that the requirement on 
settling time has been substantially exceeded because x, settles to within 1% by 
. 1.8 sec, and x, settles within 1% by 1.6 sec for both cases. However, the control 
effort on u, is larger than its specification for both initial condition cases, and 
further iteration on the design is required. To correct this situation, the designer 
should be led by the fast response to relax the demands on response time in order 
to lower the overall need for control. Figure 9.10 shows the response with ¢, in 
Eq. (9.114) selected to be 5 sec for the case with x(0) — [010]. The control uy 
is just within the specification, and the response time of x, is 2.3 sec and that of 
x, is 2.0 sec. All specifications are now met. 
Itis possible to improve the design still further by noting that the response of 
X, beats its specified maximum value and settling time by a substantial margin. 
To capitalize on this observation, let us try relaxing the cost on x,. After some 
iteration (see Problem 9.7), we find that no cost on x,, that is 


025 0 0 
Q=| 0 oo}, 
0 00 
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Figure 9.9 
Response of 
paper-machine 
closed-loop control to: 
(a) initial state 
x7(0)=[1 0 O]; 
(b) initial state 
x7(0)=[0 1 0] 


Outputs 


Outputs 


(b) Time (sec) 


results in a design that still meets all specifications and substantially reduces 
the use of both controls. Its response to x(0) = [010]’ is shown in Figure 
9.11. 
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Figure 9.10 

Response of 
paper-machine 
closed-loop control to 
initial state 

x7(0)=[0 1 0O]with 
t, lengthened to 5 sec 


Outputs 


Time (sec) 


Figure 9.11 

Response of 
paper-machine 
closed-loop control to 
initial state 

x(0)=[0 1 O]with 
t, lengthened to 5 sec 
and no weight on x, 


Outputs 


—o— a2 
—— X. 
“15 4/5 ] 
—(— 410 
22 — 
0 0.5 1 1.5 2 2.3 3 3.5 4 


Time (sec) 
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Figure 9.12 
Magnetic-tape-drive 
design example 
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Magnetic-Tape-Drive Design Example 


As a further illustration of MIMO design, we will now apply the ideas to an 
example that includes integral control and estimation. The system is shown in 
Fig. 9.12. There is an independently controllable drive motor on each end of 
the tape; therefore, it is possible to control the tape position over the read head, 
X,, as well as the tension in the tape. We have modeled the tape to be a linear 
spring with a small amount of viscous damping. Although the figure shows the 
tape head to the left of the spring, in fact, the springiness is distributed along 
the full length of the tape as shown by the equations below. The goal of the 
control system is to enable commanding the tape to specific positions over the 
read head while maintaining a specified tension in the tape at all times. We will 
carry out the design using MIMO techniques applied to the full system equations 
and conclude the discussion by illustrating how one could perform the design 
using a decoupled model and SISO techniques. 

The specifications are that we wish to provide a step change in position of 
the tape head, x,, of 1 mm with a 1% settling time of 250 msec with overshoot 
less than 20%. Initial and final velocity of x, is to be zero. The tape tension, T,, 
should be controlled to 2 N with the constraint that O < T, < 4 N. The current is 
limited to 1 A at each drive motor. 

The equations of motion of the system are 


J, = -Tr + Kati 
JÖ, = =F i K miz 
i =K (tl) +b (x, =e; 


x, = (x, + x,)/2, (9.116) 
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where 

i,, 4, = current into drive motors ] and 2, respectively (A), 
T, = tension in tape (N), 

0,0, = angular position of motor/capstan assembly (radians), 

X, X, = position of tape at capstan (mm), 
X, = position of tape over read head (mm), 
J = 0.006375 kg — m’, motor and capstan inertia, 
r = 0.1m, capstan radius, 


= 0.544N — m/A, motor torque constant, 
k — 2113N/m, tape spring constant, and 
= 3.75N — sec/m, tape damping constant. 


In order to be able to simulate a control system design on inexpensive equipment 
where the system has time constants much faster than 1 sec, it is often useful 
to time-scale the equations so that the simulation runs slower in the laboratory 
than it would on the actual system." We have chosen to time-scale the equations 
above so they run a factor of 10 slower than the actual system. Therefore, the 
settling time specifications become 2.5 sec instead of 250 msec for the actual 
system. The numerical equations below and all the following discussion pertain 
to the time-scaled system. Incorporating the parameter values and writing the 
time-scaled equations in state form results in 


X, 0 0 —10 0 Xj 
x, 0 0 0 10 x, 
W, 3.315 —3.315 —0.5882 —0.5882 w, 
@, 3.315 —3.315 —0.5882 —0.5882 w, 
0 0 
0 0 E 
iE TER, | i, |: Olt) 
0 8.533 
where the desired outputs are 
Xi 
VO ES 0.5 0.5 0 0 X; 
l ii | : l -2413 2.113 0.375 0.375 || o, | mls) 


where 
@,, €, = angular rates of motor/capstan assembly, 8, (rad/sec). 
The eigenvalues of the system matrix in Eq. (9.117) are 


s =0, 0, +j8rad/sec, 


17 See Franklin, Powell, and Emami-Naeini, 8th Ed (2019) for a discussion of time-scaling. 
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where the oscillatory roots are from the spring—mass system consisting of the tape 
and the motor/capstan inertias. This open-loop resonance has a higher frequency 
than the required closed-loop roots to meet the settling time specifications; there- 
fore, it would be wise to sample at 15x the open-loop resonance at 8 rad/sec. A 
sample rate of T = 0.05 sec results. 


Full State Feedback 


As a first step in the design, let’s try state feedback of all four states in Eq. (9.117). 
We will address the state estimation in a few pages. 

The output quantities to be controlled are x, and T, so that it is logical to 
weight those quantities in the cost Q,. According to the guidelines in Eq. (9.106), 


we should pick 
EN 1 0 
a-|o1] 
1 0 i 


and because we expect each control to be used equally 


1 0 
e-|oi] 


Although these weightings gave an acceptable result, a somewhat more desirable 
result was found by modifying these weightings to 


0 and walg f |: 
l 


where a third output has been added to that of Eq. (9.118), which represents T 
in order to obtain better damping of the tension. Thus we obtain the weighting 
matrix from 


Q, =H/Q,H,. (9.119) 


where 


oS 05 o- 0 
H, =| —2.113 2.113 0375 0375 |. (9.120) 
0 0 05 0.5 


Use of the Q, and Q, with the discrete equivalent of Eq. (9.117) in MATLAB's 
diqr.m results in the feedback gain matrix 


k- —0.823 0.286 1.441 0.311 
~ | —0.286 0.823 0.311 1.441 |’ 
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Figure 9.13 
Reference input 
Structure used for 
Fig. 9.14 


which can be used to form the closed-loop system for evaluation. Calculation of 
the eigenvalues of the discrete system and transformation of them to their discrete 
equivalent by z = e^" results in closed-loop roots at 


s = —5.5 + j5.5, —10.4 + j12.4 sec™'. 


To bring the system to the desired values of T, and x,, we use the state com- 
mand structure described in Section 8.4.2. Let us first try without the feedforward 
value of steady-state control, u,.. Calculation of the reference state, x, can be 
carried out from the physical relationships that must exist at the desired values 
of x, = | mm and 7, 2 2N 


X, tx, =2x,=2, 
T 2 


X,—2X,-— > SIS 0.000947 m = 0.947 mm. 
Solving these two equations and adding the zero velocities results in 
0.527 
— 1.473 
P 0 
0 


Evaluation of the time response using the structure in Fig. 9.13 (from 
Fig. 8.14) shows that the input current limits are violated and results in a settling 
time of 1 sec, which is significantly faster than required. In order to slow down 
the response and thus reduce the control usage, we should lower Q,. Revising 
the weightings to 


c dpi dedi T 
Q,=| 0 01 O | and 2 | (9.121) 
0 0 O1 


results in 


K= —0.210 0.018 0.744 0.074 
^ | —0.018 0.210 0.074 0.744 |’ 


Figure 9.14 
Time response using 
Eq. (9.121) 
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which produces a closed-loop system whose roots transform to 
s = —3.] + j3.1, 24.5 + j92 sec ^, 


and produces the response shown in Fig. 9.14. The figure shows that x, has a 
settling time of about 2 sec with an overshoot less than 20% while the control 
currents are within their limits; however, the tension did not go to the desired 
2 N. The steady-state tension error of the system is substantial, and so we shall 
proceed to correct it. 

As discussed in Section 8.4.2, we can provide a feedforward of the required 
steady-state value of the control input that eliminates steady-state errors. Eval- 
uation of Eq. (8.73) (see refi.m in the Digital Control Toolbox) using H from 
Eq. (9.118) and 6, T from Eq. (9.117) leads to 


| —0.2366 
| +0.2366 _ [0 0.1839 

Meo ag a N= [5 3] 
0 0 


which can be used to command the system as in Fig. 8.15(a) with 
r=[(Imm 2N)’. Note that N,r — x, = [0.527 1.473 0 707 as al- 
ready computed above, and we now have the steady state control, u,, = N,r = 
[0.368 0.368 J”, that should eliminate the tension error. The response of the 
system with this modification is shown in Fig. 9.15. It verifies the elimination of 
the tension error and that all other specifications are met. 


2.5 em E PU nn 


Outputs 
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Figure 9.15 

Time response using 
Eq. (9.121) and 
feedforward of u.. 


0 0.5 l 1.5 2 2.5 3 


Time (sec) 


Normally, designers prefer to avoid the use of feedforward of u,, alone 
because its value depends on accurate knowledge of the plant gain. The preferred 
method of reducing steady-state errors is to use integral control or possibly 
to combine integral control with feedforward. In the latter case, the integral’s 
function is simply to provide a correction for a possible error in the feedforward 
and thus can be designed to have less of an impact on the system bandwidth. So 
let's design an integral control to replace the u,. feedforward. 

Because the tension was the only quantity with the error, the integral control 
need only correct that. Therefore, we will integrate a measurement of the tension 
and use it to augment the feedback. We start out by augmenting the state with the 
integral of T, according to Eq. (8.83) so that 


E Jl -[5 az «e fae, (9.122) 
+i k 


where, from Eq. (9.118) 
H,-—I-2T1$ Ag 0375 0375). 


We will use the augmented system matrices (b, I) from Eq. (9.122) in an LQR 
computation with a revised Q, that also weights the integral state variable. This is 
most easily accomplished by forming a revised H „ from Eq. (9.120) that includes 
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a fourth output consisting of the tension integral, x,, and that is used with the 
augmented state defined by Eq. (9.122). The result is 


0 05 0.5 0 0 

— |. AO SS 2eL 213; 0:375 0.375 
E 0 0 0 0.5 05 
] 0 0 0 0 


(9.123) 


The revised weighting matrices are then obtained by the use of Eq. (9.119) with 
an additional diagonal term in Q, corresponding to an equal weight on x,, that is 


Only gi d 0 

=~ 0 01 D 0 1 0 

Q, = 0 0-0 0 and Q, = l 0 1 |: (9.124) 
D x» O 04d 


Performing the calculations results in an augmented feedback gain matrix 


K Ss —0.926 0.734 1.26 126 | 


0.137 —0.734 0.926 0.585 1.26 (9.125) 


which can be partitioned into two gains, one for the integral and one for the 
original state 


0.137 —0.926 0.734 1.26 0.585 
-| | xs E 0.926 0.585 1.26 li 112e 


These feedback gain matrices are then used to control the system as shown 
in Fig. 9.16, which is based on the ideas from Fig. 8.23. Essentially, the only 
difference between this implementation and the previous one is that we have 
replaced the u,, feedforward with an integral of the tension error. 

The roots of the closed-loop system transformed to the s-plane are 


s = —9.3, —3.1 + j3.1, 25.8 j 11.8 sec™!, 


and the time response of the system is shown in Fig. 9.17. Again, the tension 
goes to the desired 2 N, as it should, and all other specifications are met. 


Add the Estimator 


The only unfinished business now is to replace the full state feedback with an 
estimated state. We will assume there are two measurements, the tension, T,, and 
tape position, x,. Although measurements of 0, and 0, would possibly be easier 
to make, it is typically best to measure directly the quantities that one is interested 
in controlling. Therefore, for purposes of estimation, our measurement matrix is 


0.5 0.5 0 0 
"ges | —2.113 2.113 0.375 0.375 | (9.127) 
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Figure 9.16 

Integral control 
implementation with 
reference input and full 
state feedback 


=> x(k + 1) = xk) + T, GO) - T,(k) 


Figure 9.17 
Time response using Eq. (9.126) and integral control as in Fig. 9.16 


Time (sec) 
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Let us suppose that the rms accuracy of the tape position measurement is 
0.02 mm. According to Eq. (9.104), the first diagonal element of the R, matrix 
is therefore 0.02? = 0.0004. Let us also suppose that the rms accuracy of the 
tension measurement is 0.01 N. This results in 


0.000 0 
R, =| 0 ae, | ti 


Determining values for the process-noise matrix, R,,, is typically not based 
on the same degree of certainty as that possible for R,. We will assume that 
the process noise enters the system identically to i, and i; therefore, D, =P in 
Eq. (9.81). We could make measurements of the magnitude and spectrum of the 
input-current noise to determine R, via Eqs. (9.101) and (9.102). However, other 
disturbances and modeling errors typically also affect the system and would need 
to be quantified in order to arrive at a truly optimal estimator. Instead, we will 
somewhat arbitrarily pick 


0000 0 
&, =| 0 goa. | e. 


compute the estimator gains and resulting roots, then modify R, based on the 
estimator performance in simulations including measurement noise and plausible 
disturbances. Proceeding, we use the unaugmented ® and I’, based on Eq. (9.117) 
in MATLAB's kalman.m with H, from Eq. (9.127) and the noise matrices from 
Eqs. (9.128) and (9.129) to find that 


0.321 —0.120 
0.331 0.120 

L= _9124 0.142 |* (9.130) 
0.124 0.142 


which results in estimator error roots that transform to 
s = —30 + j4.5, —4.1 4715.4 sec"! . (9.131) 


If implemented using the state command structure as in Fig. 9.18, which has the 
same reference input structure as in Fig. 9.16, the response will be identical to 
Fig. 9.17. This follows because the estimator model receives the same control 
input that the plant does and thus no estimator error is excited. In order to evaluate 
the estimator, we need to add measurement noise and/or some plant disturbance 
that is not seen by the estimator. Figure 9.19 shows the system response to the 
same reference input as Fig. 9.17; but zero mean random noise with an rms of 
0.02 mm has been added to the x, measurement, noise with an rms of 0.01 N has 
been added to the fis measurement, and a step disturbance of 0.01 A has been 
added to the /, entering the plant’? at 1.5 sec. Note in the figure that there is 


18 Only the plant, not the estimator. 


416 Chapter9 Multivariable and Optimal Control 


Figure 9.18 
Integral control implementation with reference input and estimated state feedback 


Figure 9.19 

Time response with 
integral control and an 
estimator with L from 
Eq. (9.130) including 
measurement noise and 
a disturbance at 1.5 sec 
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some noise on T, and the input currents, and that the disturbance on i, caused a 
steady-state error in x,. 

This result can be modified by changing the estimator. Let us suppose that 
we wish to reduce the steady-state error on x, and we can tolerate increased noise 
effects. So let us increase the assumption for the process noise Eq. (9.129) to 


0.001 0 
&,-[ 0 odor: 


By increasing the process noise, we are essentially saying that the knowledge 
of the plant model is less precise; therefore, the optimal filter will pay more 
attention to the measurements by increasing the estimator gains and speeding up 
the estimator. Higher estimator gains should reduce the steady-state estimation 
errors and their effect on x,, but at the cost of increased sensitivity to measurement 
noise. Carrying out the calculations yields 


0.497 —0.143 
0.497 0.143 

L=| _9338 039 |' (2292) 
0.338 0.329 


which is larger in every term than the previous gain in Eq. (9.130). Using the 
gain to form the estimator error equation and transforming its roots to the s-plane 
results in 


s = —40 + j82, —2.1 + j20.6 sec’, 


which are faster than the previous roots in Eq. (9.131). Evaluation of the same 
case as in Fig. 9.19 with everything identical except for the estimator gains 
results in Fig. 9.20. Note that the actual magnitude of the measurement noise 
and disturbance was not changed in generating the figure. The value of R,, was 
changed in order to produce higher values in the gain matrix. The figure shows the 
expected result: larger sensitivity to the measurement noise but reduced sensitivity 
to the disturbance. 


Decoupled Design 


As a final comment, we note that the plant can be uncoupled and designed as 
two independent second-order systems, one for x, and one for T,. The key idea 
to accomplish the uncoupling is to note that equal commands on the two current 
inputs will cause an increase in tension with no change in x,, whereas equal and 


19 We could also add a second integral control loop on x, to kill the error, but it is preferable to reduce 
errors with high control and estimator gains in order to have a robust system for all kinds of errors, if 
the gains can be tolerated. 
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Figure 9.20 


Time response with integral control and an estimator with L from Eq. (9.132) including 
measurement noise and a disturbance at 1.5 sec 


Time (sec) 


opposite current commands will cause a change in x, with no effect on tension. 
We therefore transform the control accordingly 


-11][i 23 
yal elea ill] x- [3l (9.133) 


We also define a new state 


X, H, 
X; = T = EUR x = Tx, (9.134) 
T HF 


where H, and H, are the partitions of H, in Eq. (9.127) and F is the system matrix 
from Eq. (9.117). Following the state transformation ideas in Section 4.3.3, we 
can write that 


X, = Fx, + G,u,, (9.135) 
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where 
F,=T,FT;' and G =T GTI; (9.136) 
Performing these calculations yields 
0 1 0 0 0 0 
me te oe || titel E 
0 0 —66.3 —1.18 0 176.5 


(9.137) 
Therefore, we see that there is a decoupling that results in two separate systems, 


X 0 1 x 0 
DERE 


7, 0 l T, 3.2 
| i || estes zs || zll AR (9.139) 


e 


and 


The two system equations, (9.138) and (9.139), can then be used to design 
two control systems using SISO methods. The resulting controls, u, and u,, can 
then be “unscrambled” via T, to arrive at the desired i, and i5, that is, 


[jamie] 


The perfect uncoupling was due to the perfect symmetry of the problem: mo- 
tors and capstan inertias identical, motor constants identical, and tape read head 
exactly in the middle of the capstans. In an actual system, there would be differ- 
ences and some coupling would be present. Due to the simplicity of the uncoupled 
design and its implementation, however, it can be worthwhile to evaluate the per- 
formance of a decoupled design to see if its errors warranted the full MIMO 
design. 


9.6 Summary 


e An optimal controller is one that minimizes a quadratic cost function where 
weighting matrices are selected by the designer to achieve a “good” design. 
Criteria for evaluation of the design are based on all the same factors that 
have been discussed throughout the book. 

e Optimal control methods are useful in designing SISO systems because they 
help the designer place the pole locations. 

e Optimal control methods are essential in designing MIMO systems as they 
reduce the degrees of freedom in the design iteration down to a manageable 
few. 
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* Generally, for linear, stationary systems, the steady-state optimal control 
solution is satisfactory and far more practical than the true optimal time- 
varying gain solution. 

* The most accurate discrete emulation method is based on finding the discrete 
equivalent of a continuous cost function and solving for the discrete optimal 
control system with that cost. 


e An optimal estimator is based on achieving the minimum mean square esti- 
mation error for a given level of process and measurement noise acting on 
the system. 


* Generally, for linear, stationary systems, the steady-state optimal estimator 
is satisfactory and far more practical than the true optimal time-varying gain 
solution (Kalman filter). 


* Optimal estimators are useful for SISO system design and essential for 
MIMO system design. 


e The optimization procedures provide a technique for which many good de- 
sign candidates can be generated which can then be evaluated to ascertain 
whether they meet all the design goals. The procedures do not eliminate 
trial-and-error methods; rather, they transfer iteration on classical compensa- 
tion parameters to iteration on optimal cost-function parameters or assumed 
system noise characteristics. 


9.7 Problems 


9.1 Optimal control derivation: 
(a) Derive Eq. (9.33) and hence verify the control Hamiltonian as given by Eq. (9.34). 


(b) Demonstrate that if W is a transformation that brings H, to diagonal form, then the 
first n columns of W are eigenvectors of H, associated with stable eigenvalues. 


(c) Demonstrate that if the optimal steady-state control, K,,, given by Eq. (9.47) is 
used in the plant equation Eq. (9.10), then the matrix in the upper left corner of W, 
which is X,, has columns that are eigenvectors of the closed-loop system. 


9.2 Symmetric root locus: 


(a) Compute the closed-loop pole locations of the satellite design problem, using the 
steady-state gains given by Fig. 9.3. 


(b) Take the discrete transfer function from u to 0 for the satellite design problem 
(Appendix A) and form the symmetric root locus. Plot the locus and locate the 
values of p corresponding to the selections of Q, given in Eq. (9.30). 


(c) Show that fora system with onecontrol it is always possible to construct a symmetric 
root locus corresponding to the optimal steady-state control. (Hint: Show that if 
Q = HH" and G(z) is a column-matrix transfer function given by H(zlI — $)r, 
then the roots of the optimal control are given by | + pG’ (z7!)G(z) = 0, which 
is a scalar symmetric root-locus problem and which can be put in the form 
1 + pG, (z^ )G, (z) = 0. Use Eq. (9.38).] 
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(d) Give the equivalent scalar plant transfer function G, (z), if we have the satellite 
control problem and use Q, — I, the 2 x 2 identity. Draw the symmetric root locus 
for this case. 


9.3 Compute the location of the closed-loop poles of the optimal filter for the satellite problem 
for each of the values of R, given by the steady final values of the curves of Fig. 9.8. 


9.4 For thc antenna example in Appendix A.2, 


(a) Design an optimal controller corresponding to the cost function J (y, u) = u? + py? 
for p = 0.01, 0.1, and 1. Use T = 0.2 sec. Plot the step response of the closed 
loop system for initial errors in y and y in each case. Which valuc of p most nearly 
meets the step-response specification given in Chapter 7? 


(b) Draw the symmetric loot locus corresponding to the design of part (a). 


(c) Design an optimal steady-state filter for the antenna. Assume that the receiver noise 
has a variance R, = 107Órad? and that the wind gust noise, w,,is white, and we 


want three cases corresponding to R ppsa = 1072, 1074, and 107. 
psd 


(d) Plot the step response of the complete system with control law corresponding to 


p = 0.1 and filter corresponding to Rega 2107. 
9.5 The lateral equations of motion for the B-767 in cruise are approximately 
ù —0.0868 | —1 —0.0391 0 v 
a e 2.4 —0.228 0 —0.0204 r 
OS 0 0 0 l 9, 
P —4.41 0.334 0 —1.181 p 
0.0222 0 
—1.165 —0.065 ô, 
A E al 
0.549 —2.11 


(a) Design two second-order controllers by ignoring the cross-coupling terms. Pick 
roots at s = —1 + j1.5 rad/sec for the yaw mode (v and 7) and s = —2, —0.1 for 
the roll mode (6, and p). Use a sample period of T = 0.05 sec. 


(b) Determine the root locations that result when the cross-coupling terms are included. 
(Use eig.m in MATLAB.) 


(c) Assuming one can measure r and @ , design two second-order estimators by 
ignoring the cross-coupling terms. Place all poles at s = —2 rad/sec. 


(d) Determine the root locations that result from (c) when the cross-coupling terms are 
included. 


9.6 The equations of motion for a stick balancer (Fig. 9.2) are given by 


6 o UT w W 8 0 
&| 200 os » 29 
S SS PAREDES C be! Dep NA ER 
ù o 5: 0 9 |l 2 1 


RPEN 


exeo 
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Figure 9.21 

Disk drive closed-loop 
system for Problems 
9.11 and 9.12 


9.7 


9.8 


9.9 


9.10 


9.11 


Design two second-order estimators, one for the stick (0, w) and one for the cart (x, v). 
Verify that the one-way coupling (cartto stick only)causes the full fourth-order estimator 
to have the same roots as the separate second-order designs. 


Starting with the paper-machine design in Section 9.5.3 whose output is shown in 
Fig. 9.11, perform design iterations on Q,, Q,, and £, to find the lowest possible value 
of u, that still meets the state specifications on maximum value and settling time. [Hint: 
The routines FIG99.M, FIG910.M and FIG911.M from the Digital Controi Toolbox will 
be useful to you.] 


For the double mass-spring system in Example 9.5, investigate the effect of different Q's 
on the closed-loop poles of the system. In particular, determine a Q that will yield an 
equivalent damping of the oscillatory poles that is greater than ¢ = 0.5. [Hint: FIG95.M 
from the Digital Control Toolbox will be useful to you.] 


For the double mass-spring system in Example 9.5, design a controller and estimator that 
uses a measurement of d. It should respond to a command input for d with a rise time of 
2 sec with an overshoot less than 20%. 


For the double mass-spring system in Example 9.5, design a controller and estimator that 
uses a measurement of d. It should respond to a command input for d with a rise time of 
3 sec with an overshoot less than 15%. Plot the frequency response of your compensation 
(control plus estimator) and qualitatively describe the features of the compensation. 
A simplified model of a disk drive is 
E 1059 
zi s? (s? + 260,5 +02)’ 
where w, = 19,000 rad/sec (=3 KHz) and¢ = 0.05. Thisis the transfer function between 
the torque applied to the disk head arm and the motion of the read head on the end of 
the arm. Note this is also the same generic model as the double mass-spring system in 
Eq. (A.20). The system has the capability to measure where the heads are with respect 
to the data tracks on the disk. The two biggest disturbances are due to an actuator bias 
(w,) and track wobble (y,) from an off center disk mounting. A schematic diagram of 
the system to be controlled is shown in Fig. 9.21. 

Since the disk is rotating at 3822 rpm (400 rad/sec), an offset causes a sinusoidal 
y, to be added to the output of the transfer function above. The purpose of the servo is 
to follow the data tracks as closely as possible, whether the disk is slightly off center or 
not because that allows higher data densities on the disk. Itis also desirable for the servo 
to settle on a new track as quickly as possible because the disk access time is directly 
related to this settling time and is a major selling point for disk drives. 

Design a discrete compensator with a sample rate no faster than 8 kHz that has an 
overshoot to a step command no greater than 20%, does not allow steady state tracking 


G(s) 


9.12 


9.13 


9.14 


9.15 
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errors due to the actuator bias, and has a settling time (to within 5% of the final value) 
to a step input less than 5 msec. The system should be robust with respect to variations 
in the resonance frequency as this can change by +2% due to temperature changes. Plot 
the closed-loop frequency response of the error, e, in order to evaluate how well your 
system attenuates the disturbance response in the vicinity of 400 rad/sec (3822 rpm). 


Design a compensator for the disk drive servo specified in Problem 9.11 so that it provides 
the minimum possible error due to the disturbance at 3822 rpm. Since the spin rate drifts 
some, define the error to be the average of that from: the nominal spin rate of 3822 rpm, 
+1% of the nominal, and —1% of the nominal. (Hint: Review Section 8.5.) 


For 
G(s) 
s)= 5, 
$ 


use optimal control and estimation to design compensation that provides a 10 msec rise 
time with less than a 10% overshoot to a command. Determine the closed-loop system 
bandwidth and the system phase margin. 


The MATLAB code below will generate some noisy data from an oscillator driven by a 
square wave 


F=[0 1;-1 0];G=[0;1];H=[1 0]}:J=0;T=0.5; 

sysC = ss(F,G,H,J) 

sysD=c2d(sysC,T) 

Un=[ones( 1,10) zeros(1,10) ones(1,10) zeros(1,11)]; 
U=Un+0.1*randn(size(Un)); 

Ynzlsim(sysD,U); 

Ym=Yn+0.2*randn(size(Yn)). 


The measurement noise is zero mean and normally distributed with R, = 0.2? while the 
process noise is zero mean and normally distributed with R,, = 0.1. Assume the initial 
value of the state has a covariance matrix 


and use a time-varying Kalman filter to reconstruct the state. Compare the resulting time 
history of the optimal state estimate with that from a constant gain Kalman filter. [Hint: 
FIG98.M from the Digital Control Toolbox will be useful to you.] 

Repeat Example 9.7, but fit the best straight line to the data. [Hint: FIG97.M from the 
Digital Control Toolbox will be useful to you.] 


 IO- 


Quantization Effects 


A Perspective on Quantization 


In Chapter 13 we will consider the analysis of systems having general nonlinear- 
ities; but first, in this chapter, we will consider the special nonlinearity of digital 
control: Numbers in the computer must be forced to fit in digital words that are 
defined by a finite number of bits, usually 8, 16, 32, or 64 bits. Thus far we 
have considered the use of the digital computer as a linear, discrete-time device 
for implementing control designs. Now we tum to consideration of the fact that 
numbers in the computer are taken in, stored, calculated, and put out with finite 
accuracy. The technique of representing a real number by a digital value of finite 
accuracy affects digital control in two principal ways. First, the variable values 
such as e, u, and the internal state variables, x, used in the difference equations are 
not exact, and thus errors are introduced into the output of the computer. As we 
shall see, these errors can often be analyzed as if there were noise sources in the 
computer. Second, the coefficients such as a, and b, of the difference equations, 
which must be stored in the computer, cannot be arbitrary real numbers but must 
be realizable by a finite number of bits. This can cause the machine to solve a 
slightly different equation than it was designed to do and has the potential to result 
in instability. Our purpose in this chapter is to explore methods that can be used 
for the analysis of these two effects: quantization of variables and quantization 
of coefficient parameters. The effect of quantization when the microprocessor 
has 32 bits is typically not noticeable. However, some products are very cost 
sensitive and it is desirable to use an 8 bit computer, if possible. The material in 
this chapter is primarily directed toward the design of such systems. 


Chapter Overview 


The random error model for the round-off process is first developed in Sec- 
tion 10.1. This section also develops several analysis methods for such random 
processes that are useful for quantization and then applies the random process 
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fixed point 


floating point 


quantization 


round-off 


10.1 


analysis methods from Chapter 9 to the quantization case. Section 10.2 examines 
the deterministic problem of parameter storage quantization errors and demon- 
strates the sensitivity of the errors to the structure of the difference equation 
mechanization. The last section, 10.3, describes some of the consequences of 
round-off errors and what steps can be taken to alleviate them. 


Analysis of Round-Off Error 


In our first analysis of finite accuracy we will assume that the computer repre- 
sents each number with a fixed location for the equivalent of the decimal point, 
or fixed point representation. Although for any given number the point is fixed, 
considerations of amplitude scaling and dynamic range often require that differ- 
ent numbers in the same program have their decimal points in different locations. 
The consequence is that different magnitudes of errors are introduced at differ- 
ent locations as the results of calculations being fitted into the proper computer 
number format. Fixed point representation is typical in real-time control com- 
puters; however, in scientific calculations done in higher-level languages such as 
BASIC, MATLAB, or C, floating point representations are mainly used, wherein 
the number representation has both a mantissa (magnitude and sign information) 
and an exponent that causes the location of the decimal point to float. With fixed- 
point arithmetic, addition and subtraction are done without error except that the 
sum can overflow the limits of the representation. Overflow must be avoided by 
proper amplitude scaling or analyzed as a major nonlinearity. For control- or 
signal-processing applications, the logic is arranged so that the result of overflow 
is a saturation effect, a nonlinearity that will be treated in Chapter 13. In the case 
of multiplication, however, a double-length product is produced and the machine 
must reduce the number of bits in the product to fit it into the standard word size, 
a process we generally call quantization. One way to perform quantization is by 
ignoring the least significant half of the product, a process called truncation. If 
we assume that the numbers are represented with base 2 and that £ binary digits 
(bits) are to the right of the point (for the fractional part), the least significant 
bit kept represents the magnitude 27. The result of truncation will be accurate 
to this value (the part thrown away could be almost as large as 2 *). Thus a plot 
of the "true" value of a variable x versus the quantized value, x would look 
like Fig. 10.1(a), where the error (shown in (b)) is decided by the quantum size, 
q (which is 27* under the conditions mentioned above). It is common practice 
in control computers to use round-off rather than truncation. With round-off, 
the result is the same as truncation if the first bit lost is a 0, but the result is 
increased by 2~ if the first bit lost is a 1. The process is the same as is common 
with ordinary base 10 numbers where, for example, 5.125 is rounded to 5.13, but 
5.124 becomes 5.12 to two (decimal) places of accuracy. An input—output plot 
of rounding is shown in Fig. 10.1(c), and the corresponding error is shown in 


Figure 10.1 
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Plot of effects of number truncation. (a) Plot of variable versus truncated values. (b) Plot 
of error due to truncation. (c) Plot of variable versus rounded values. (d) Round-off error 


q 
x 
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€ 
Q 
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10.1(d).' Notice that the maximum error with roundoff is half that resulting from 
truncation. The value of £ in a particular case depends on the word size of the 
particular computer in use. For control implementations the choice is based on 
the required accuracy and dynamic range and is limited by the expense. At this 
time (1997), microprocessors are readily available with word sizes of 8, 16, and 
32 bits, with a cost premium on the larger sizes. One of the goals of this chapter 
is to analyze the effect of £ and thus of word size on the stability and on the 
errors due to quantization's effects so the designer can select the smallest word 
size consistent with the required performance. 
We can represent either truncation or round-off by the equation 


x-x-ce, (10.1) 


| The plots in Fig. 10.1 assume that the numbers arc represented in two's complement code. 
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quantization models 


Bertram bound 


Figure 10.2 

A linear system and the 
introduction of one 
source of round-off 
errors 


where e€ is the error caused by the truncation or round-off of x into the digital 
representation x. Because the error due to truncation equals a constant plus 
round-off error, we will assume round-off in our analysis unless otherwise stated 
and assume that analysis of the effects of the additional constant bias when 
truncation is used (which is rare) can be treated separately. Clearly the process of 
analog-to-digital conversion also introduces a similar effect, although often with 
still another value for g than those resulting from round-off during arithmetic in 
the microprocessor. 

The analysis of the effects of round-off depends on the model we take for 
€. We will analyze three such models, which we can classify as (a) worst case, 
for which we will bound the error due to round-off; (b) steady-state worst case, 
for which we will compute the largest output possible if the system reaches a 
constant steady state; and (c) stochastic, for which we will present a model of the 
round-off error as a random process and compute the root-mean-square of the 
output error due to round-off. 


The Worst-Case Error Bound 


The worst-case analysis is due to Bertram (1958). His analysis takes the pes- 
simistic view that the round-off occurs in such a way as to cause the maximum 
harm, and his analysis bounds the maximum error that can possibly result from 
round-off. First we consider the case shown in Fig. 10.2, in which a single case 
of quantization is assumed to occur somewhere in an otherwise linear constant 
system. 

There is a transfer function from the point of round-off to the output, which 
we will call H, (z), and we can describe the situation by the equations 


Y(z) = H(z)U(z), 
Y(z) = H(2U(z) — H (z)E (z; x), 


Y-P=Y=A,(2)E,(z; x). (10.2) 
u(k) yk) 
U(z) Y(z) 


u(k) 
U(z) 
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We write E, as a function of the state variable x to emphasize that Eq. (10.2) is 
not linear because we do not know how to compute E, until we have the values 
of x. However, we are not looking for the exact value of Y but an upper bound 
on the time error, y(k). 

Because we wish to find a bound on the time signal y(k), we need the time 
domain equivalent of Eq. (10.2), which is the convolution sum given by 


y(n) = h(k)e,(n — k; x). (10.3) 
k=0 
If we examine Fig. 10.1(d), we can see that whatever the exact values of e, 
may be, its magnitude is bounded by q,/2, where q, is the quantum value for 
the operation which introduces the quantization being analyzed. We can use this 
information to determine in short steps the bound on y as we did in Chapter 4 
in the discussion of BIBO (bounded input, bounded output) stability. Taking the 
magnitudes of both sides on Eq. (10.3), we get 


BiG 


Because the sum is bounded by the sum of the magnitudes of each term, we have 
the inequality 


FE 


n 
s hel 
n 


which is the same as 


< Mel: 
0 


but by Fig. 10.1(d), the error magnitude is always less than q,/2, so the output 
error is bounded by 


n q 
< 2 


Finally, the sum can only get larger if we increase the number of terms 
— og 
CES h |2. 10.4 
ly(n)| < i | l5 (10.4) 


Equation (10.4) is Bertram's worst-case bound. The function qwc.m computes 
it. By comparison with the condition for BIBO stability, we can conclude that if 
the linear system is BIBO stable in response to inputs applied at the point of the 
quantization, then introduction of the quantization will not cause the system to be 


430 Chapter 10 Quantization Effects 


unstable in the BIBO sense.? As we will see later, the system with quantization 
can have an output error that is nonzero either as a constant or as an oscillation; 
so the system may not be asymptotically stable but the output error will not grow 
beyond the bound given by Eq. (10.4). 


€ Example 10.1 


Bertram Bound Application 
For the first-order system 

ylk +1) 2 ay(k) + ulk), (10.5) 
compute Bertram’s bound, assuming that the round-off occurs in the computation of the 
product oy. 
Solution. Using Eq. (10.4), we have 

HK 41) = o0) + elk) + uk), (10.6) 

and thus 

yk + 1) = ay(k) — €(k). (10.7) 


For the error system, the unit pulse response is h, (k) = o* and Eq. (10.4) is easily computed 
as follows for Ja| < 1 


"EL SS 1 
HUER es (10.8) 


* 


For this example, if o > 0, the error due to quantization at any particular 
time is bounded by the DC gain from the quantizer to the output times 9/2. This 
bound is only likely to be approached when a system has a constant input and 
has settled to its steady-state value. 


The Steady-State Worst Case 


The steady-state worst case was analyzed by Slaughter (1964) in the context 
of digital control, and by Blackman (1965) in the context of digital filters. For 
this analysis, we view the round-off to cause some stable, transient errors of no 
special concern, and we assume that all variables eventually, in the steady state, 
become constants. We wish to know how large this steady-state error can be as a 
result of round-off. We consider again the situation shown in Fig. 10.2 and thus 
Eq. (10.3). In this case, however, we assume that Eq. (10.3) reaches a steady state, 


3 We assume that there is no saturation of the quantizer during the transient. 
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at which time e is constant and in the range —q/2 < e,, < q/2. Then Eq. (10.3) 
reduces to 


y,(09) = hoe, 
0 


The worst steady-state error is the magnitude of this signal with €,, = q/2, 
which is 


I5, (co)| < 2: (10.9) 


Y h.0) 
0 


There is one nice thing about Eq. (10.9): The sum is the value of the transfer 
function H,(z) atz = 1, and we can write? 


I5. (09) < LAS (10.10) 


€ Example 10.2 


Worst Case Steady-State Bound 
Use the worst case analysis to compute the error bound for the first-order system given by 
Eq. (10.5). 
Solution. In this case, the transfer function is 
H,(2) = MG — a). 


From Eq. (10.10), we find that the bound is 


= q 
= k l1 
AEE (10.11) 


This is the same as the Bertram worst-case bound if a > 0. 


+ 


So we see that this steady-state worst case yields the general result that the 
error is bounded by q /2x dc gain from the quantization source to the output. 

Equations (10.4) and (10.9) or (10.10) express the consequences of round- 
off from one source. For multiple sources, these equations can be extended in an 


4 A function to compute the quantization steady-state error is given by qss.m. 
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obvious way. For example, if we have K sources of round-off, each with possibly 
different quantization levels g,, then Eq. (10.4) becomes 


3 a [ee 4, NV B o 
MES [Zr + 2, 001 + | 


K oo q, 
< hale. (10.12) 


If all the quanta should be equal, the error bound is 


DE Smot one] 2 


n=0 n=0 


And likewise for multiple sources, Eq. (10.10) is extended to 
b, (69)I s AOE HOI + LIIS. (10.13) 


It is easily possible to express Eq. (10.10) in terms of a state-variable formu- 
lation of the equations of motion. Suppose the equations from the point of the 
quantization to the output are given by 

x(k + 1) = ®x(k) + Tye, (k), 
y(k) = Hx(k) + Je (k). (10.14) 
The assumptions of the steady-state worst-case error are that x(k + 1) = x(k) = 
x,, and e, (k) = e, ... Then 


x, = Ox, +16...  y-Hx.-Je,.. (10.15) 
Solving for y, we find 
5 = [HI — DIT, + Je, ,.. 
which is bounded by 
[5] < I[HIL — ^T, + nid. (10.16) 


The major advantage of the steady-state result is the vastly simpler forms of 
Eqs. (10.13) and (10.16) as compared to (10.12). Unfortunately, Eq. (10.13) does 
not always hold because of the assumption of a constant quantization error of 
4/2 in the steady state,° and yet the worst-case upper bound given by Eq. (10.12) 
is often excessively pessimistic. However, in some cases there is great expense 


5 Note that Eqs. (10.13) and (10.16) are nor bounds on the error. They do, however, give a valuable 
estimate on error size, which is easy to compute for complex systems. 


quantization variance 
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due to distortion of the data should a signal overflow, and the absolute bound 
can be used to select q so that no overflow is possible. An example of the use of 
Bertram’s bound in the design of a spectrum analyzer is given in Schmidt (1978). 


Stochastic Analysis of Round-Off Error 


The third model for round-off error is that of a stochastic variable. The analysis 
will follow Widrow (1956) and has two parts: development of a stochastic model 
for e(k) and analysis of the response of a linear system to a stochastic process 
that has the given characteristics. Because the development of the model requires 
use of somewhat sophisticated concepts of stochastic processes, and because 
the model can be given a very reasonable heuristic justification without this 
mathematical apparatus, we develop the model heuristically and proceed with 
an analysis of the response. A review of the necessary facts from the theory of 
probability and stochastic processes is to be found in Appendix D. 

First, then, we give a heuristic argument for a stochastic model of round- 
off error. We begin with examination of Fig. 10.1, where we have a plot of the 
output versus the input of the round-off operation and a sketch of error versus 
amplitude of the input signal x. If we imagine that x(n) is a random variable 
that takes values at successive sampling instants in a scattered way across the 
scale of values, then it seems reasonable to suppose that the sequence of errors 
€(n) would be scattered over the entire range of possible values, which is to say, 
over the range from —q/2 to q/2. Furthermore, because the "teeth" in the saw 
like plot of € versus x are linear and contain no flat places that would signal a 
preference for one value of e over another, it seems reasonable that the values of 
€ (n) are equally likely to be anywhere in the range q/2 < € < q/2. Furthermore, 
if the signal into the quantizer typically moves several quanta during one sample 
period, it seems reasonable to assume that the values of error at one sample time 
will not be correlated with errors at other times; that is, its spectrum would be 
expected to be flat, which we characterize as "white." The reflection of this 
argument in terms of stochastic processes is to assert that we can model €(k) as 
a white random process having a uniform probability density from —4/2 to q/2. 
A plot of the uniform density is shown in Fig. 10.3. 

From this density we can immediately compute the mean and variance of € 
as follows 


3/2 


= 1 
n - Ele) = | spado | § ag = 


6 If the input to the quantizer is a deterministic signal such as a square wave, a constant, or a sine wave, 
this argument is clearly wrong. It is also approximate if the input is a random signal with strong 
correlation over a timc equal to a sampling period. A careful analysis is given in Clavier, et al. (1947). 
If the input is a sine wave, the error spectrum typically has a number of large spikes, far from flat or 
white. 
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Figure 10.3 
Plot of the uniform 
density function 


covariance propagation 
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| Ye I ae | ] 
=) -2-iG2»- Cam 
25] 3 (4/ 3 (al 
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Thus we assume the following white-noise model for €(”) for the case of 
round-off error as sketched in Fig. 10.1(d) 


e(n) = w(n), (10.19) 
where w(n) has the autocorrelation function (defined in Appendix D) 
R(n) = €tw(k)w(k t- n) = 97/12 (n = 0) 
=0 (n +0), (10.20) 


and the mean value u, is zero. 

With the model given by Eqs. (10.19) and (10.20) we can compute the mean 
and variance of the system error due to round-off using state-space methods. 
Again, we observe that analysis of the response to any constant nonzero mean 
u, can be computed separately from the response to the white-noise component 
w(n). 

In order to analyze the effect of the random part of the quantization noise, 
let's review (see also Section 9.5) the general problem of finding the effect of 
zero-mean noise, w, on a linear system 


x(k + 1) =P x(k) +P wk) 


y(k) = H, x(k) + J wk). (10.21) 
We define the covariance of the state as 
R (k) = E{x(k)x7(k)}, (10.22) 


and, atk +1, 
R,(k +1) = € {x(k + Dx" (k + 1)] 
= €{(®,x(k) +r wk) (P x(k) +r wh)"} 
=P R (DO, +r ROF. (10.23) 


Lyapunov equation 


10.1 Analysis of Round-Off Error 435 


Equation (10.23) can be used with an initial value (typically zero) of R, to 
compute the transient development of state covariance toward the steady state. 
Typically, we are only interested in the steady state, which is obtained by letting 
R,(k + 1) = R,(k) = R, (co). In this case, Eq. (10.23) reduces to the equation 
(called the discrete Lyapunov equation for its occurrence in Lyapunov’s stability 
studies) 


R (00) = ®,R,(co) ®7 + TRY. (10.24) 


Several numerical methods for the solution of Eq. (10.24) have been developed, 
some based on solving Eq. (10.23) until R, no longer changes, and others based 
on converting Eq. (10.24) into a set of linear equations in the coefficients of R, 
and solving these equations by numerical linear algebra as done by dlyap.m in 
MATLAB. 


€ Example 10.3 


Random Quantization Error for a First-Order System 
Determine the output of the first-order system of Eq. (10.5) forthe case where the multiplication, 
ax(k), is rounded with a quantum level q. 
Solution. The first-order system of Eq. (10.5), with the rounding error represented by e is 
x(k 4- 1) 2 ax(A) 4 e(k), 
and thus 
=a and r=! 


and from Eq. (10.18), we see that 


so that Eq. (10.24) is 
2 


q 
R,-aR I=); 
, 7 «R,a + (1) 12 * ) 
2 
q 1 
Ri 10.25 
s" i-o din 
Taking the square root, we find that 
q I 

o, JR, —2dzT—— 10.26 
5 v Ny adaa ae 


A plot of Eq. (10.26) along with the value of the worst-case bound from Eqs. (10.8) and 
(10.11) is shown in Fig. 10.4. 


* 
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Figure 10.4 
Error estimates and error 
bounds 
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Normalized quantization error 


In some cases, the output of a system is not one of the state elements. In 
other words, 
y=H,x+ Jw, 
and, therefore, 
E") = € [H x + J,w)(Hyx + Jw)"}, 
which is to say 
R, =H R, H; +J, RJ. (10.27) 
Note that we can use Eq. (10.23) and (10.24) to compute the covariance of the 
state due to round-off error at several locations simply by taking w to be a column 


matrix and R, to be a square diagonal matrix of covariances of the components 
of w. In the multiple-source case, I is a matrix of dimension n x p, where there 


are n states and p sources. 


€ Example 10.4 


Random Quantization Error for a Second-Order System 


Determine the quantization error of the second-order system 


l+a + 
y(k +2) +a ylk +1) +a,y(k) = Ah) +u(k —1)), 


in terms of the random root-mean-square response and compare it with the worst-case bounds. 


Figure 10.5 

Error estimates and error 
bound with quantization 
input to a second-order 
system, where r = 0.9 


10.2 


10.2 Effects of Parameter Round-Off 437 


12 — * "T". T = 
H(z)=K z*l 
212r cos 0z +r? i | | i 
10L H(l)=1 en ee a E 


| AS XXX = Worst-case bound 
000 = RMS estimate * 


+++ steady state | 


Normalized quantization error 


0 20 40 60 80 100 120 140 160 


Solution. The gain of the system has been set to be unity at frequency 0, and the parameters 
are related to the pole locations according to a, = —2r cos(0) and a, = r?. In order to study the 
role of the frequency of the pole, given in normalized terms by the angle 0, we have computed 
the estimated errors according to Eqs. (10.4), (10.16), and (10.27) for several values of 0 and 
plotted them in Fig 10.5. For the random-noise case, the square root of the result of Eq. (10.27) 
is plotted. From the plot, it is clear that the worst-case bound is quite large compared to the 
other two computations for most of the range of angles. Experiments in the laboratory with 
random inputs to the system give results that compare very favorably with the numbers given 
by the random model. 


+ 


Effects of Parameter Round-Off 


We have thus far analyzed the effects of round-off on the variables such as y and u 
in Eq. (10.5). However, to do the calculations for a controller, the computer must 
also store the equation coefficients, and if the machine uses fixed point arithmetic, 
the parameter values must also be truncated or rounded off to the accuracy of the 
machine. This means that although we might design the program to solve 


y(k + 1) = ay(k) + u(k), 
it will actually solve 
Ş(k + 1) = (a + ôæ)Ş(k) + u(k) + e(k). (10.28) 
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In this section, we give methods for the analysis of the effects of the parameter 
error da. 

The principal concern with parameter variations is that the dynamic response 
and especially the stability of the system will be altered when the parameters are 
altered. One way to study this problem is to look at the characteristic equation and 
ask what effect a parameter change has on the characteristic roots. For example, 
for the first-order system described by Eq. (10.28), the perturbed characteristic 
equation is 


z — (œ 4-60) = 0; 


and it is immediately obvious that if we want a pole at z — 0.995, it will be 
necessary to store œ to three decimal places and that the limit on óc for stability 
is 0.005 because a variation of this magnitude results in a pole on the unit circle. 
Note, however, that if we structure the œ term as œ = | — B and add the 1-term 
separately then B = 1 — a = 0.005, and the relative accuracy requirements on £ 
are much less than those on o. Thus we see that the details of the architecture of a 
realization can have a major impact on the robustness of the design to parameter 
value quantization. It is also the case that different canonical forms realized with 
a given parameter accuracy (word size) will be able to realize a different set of 
pole locations. It is often possible to select the realization structure in such a way 
that the desired dynamics are almost exactly realized with the available word 
length. 

To study these matters, we consider the characteristic equation and ask how 
a particular root changes when a particular parameter changes. The general study 
of root variation with parameter change is the root locus; however, we can obtain 
results of some value by a linearized sensitivity analysis. We can compare the 
direct and the cascade realizations of Chapter 4, for example, to see which is 
to be preferred from a sensitivity point of view. In the direct realization, the 
characteristic equation is 


z" To ae alo A =0. (10.29) 


This equation has roots at A,, A,,...,4,,, where A; = rel". We assume that one 
of the o''s, say œ,, is subject to error da, , and we wish to compute the effect this 
has on A j and especially the effect on r, So that stability can be checked. For 
this purpose, we can conveniently write Eq. (10.29) as a polynomial P(z, o) that 
depends on z and œ,. At z = js the polynomial is zero, so we have 


P(A, a) — 0. (10.30) 


If œ, is changed to a, + óo,,, then À; also changes, and the new polynomial is 


oP aP 
P(A, + 5A,, o, + da,) = P(A,, o) + wars 5A, + — 08, tes 
ð z=); da, 


=0, (10.31) 


root sensitivity 
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where the dots represent terms of higher order in 5A and da. By Eq. (10.30) we 
see that the first term on the right-hand side of Eq. (10.31) is zero. If A, and da, 
are both small, then the higher-order terms are also negligible. Thus the change 
in A, is given to first order by 


"P 8P/0a, 
d aP/dz 


Sat, (10.32) 


z=}; 


We can evaluate the partial derivatives in Eq. (10.32) from Eq. (10.29) and the 
equivalent form 


Riega p = (e— X) aA (2 — 4). (10.33) 
First, using Eq. (10.29), we compute 
ə P 
— = Los (10.34) 
9e, Lo 


and next, using Eq. (10.33), we compute’ 


aP 
=| = ] [o; ^^» (10.35) 
Z leek, — tyj 
Thus Eq. (10.32) reduces to 
n—k 
Aj 


6A, = — —— ——————- ôa. 
: Lg; (A, =À) : 

We can say things about root sensitivity by examining Eq. (10.36). The 
numerator term varies with the index number of the parameter whose variation 
we are considering. Because we are dealing with a stable system, the magnitude 
of A, is less than one, so the larger the power of Aj = the smaller the variation. We 
conclude that the most sensitive parameter is a, , the constant term in Eq. (10.29). 
However, for values of i, near the unit circle, the relative sensitivity decreases 
slowly as k gets smaller. The denominator of Eq. (10.36) is the product of vectors 
from the characteristic roots to à.. This means that if all the roots are in a cluster, 
then the sensitivity is high, and if possible, the roots should be kept far apart. For 
example, if we wish to construct a digital low-pass filter with a narrow-pass band 
and sharp cutoff, then the system will have many poles in a cluster near z = 1. 
If we implement such a filter in the control canonical form (Fig. 4.8c), then the 
sensitivity given by Eq. (10.36) will have many factors in the denominator, all 
small. However, if we implement the same filter in thc cascade or parallel forms, 
then the sensitivity factor will have only one term. Mantey (1968) studies these 
issues and quotes an example of a narrow-bandpass filter of six poles for which 


(10.36) 


7 This is true if P(z) has only one root at Àp At a multiple root, this derivative is zero and the coefficient 
of dc, in Eq. (10.36) is not bounded. 
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the parallel realization was less sensitive by a factor of 10 ?. In other words, it 
would take 17 bits of additional accuracy to implement this example in direct 
form over that required for the parallel or cascade form! The papers collected by 
Rabiner and Rader (1972) contain many other interesting results in this area. 


€ Example 10.5 


10.3 


Illustration of Parameter Storage Errors 


Compare the sensitivity of a fourth-order controller in cascade form 


(z +1% 
LOIR MEN Lo M 
(2 = Grong —085)z — 08) — 075) 


versus the same controller in direct form 
z? +327 +3z +1 


D = —— o O 
(2) = 96 3,300025 + 4.07752! — 2.23582 + 0.4590 


Solution. If the transfer function is realized as a cascade of the first-order terms, then it is 
clear that the pole nearest to the unit circle is at z = 0.9, and a change in the coefficient by 
0.1 will move this pole to the unit circle and lead to instability. This is a percent change of 
(0.1/0.9)100 = 11.1%. On the other hand, if the controller is realized in one of the direct 
forms—either control or observer canonical form—then the coefficients used are those in 
the polynomial form. In this case, by numerical experimentation it is found that a change 
of a, from 0.4590 to 0.4580, a root is moved to the unit circle. This is a change of only 
(0.001 /0.4590) 100 = 0.22%. Thus we see that the cascade form is more robust to parameter 
changes by a factor of almost 50 over the direct forms! 


* 


Limit Cycles and Dither 


As a final study of the effects of finite word length in the realization of digital 
filters and compensators we present a view of the quantizer as a signal-dependent 
gain and analyze more closely the motions permitted by this nonlinearity. One 
type of motion is an output that persists in spite of there being no input and that 
eventually becomes periodic: Such a motion is called a limit cycle. 

To analyze a nonlinear system we must develop new tools because superpo- 
sition and transforms do not apply. One such tool is to use graphic methods to 
solve the equations. For example, suppose we have the first-order equation 


y(k + 1) = Ofay(k)}. (10.37) 


If we plot y versus ay as in Fig. 10.6, we can also plot the function Q(a@y) and 
trace the trajectory of the solution beginning at the point a on the y-axis. Across 
from a at point b, we plot the value ay from the line with slope 1/o. Below b 
at c is the quantization of oy, and hence the next value of y, shown as d. We 


Figure 10.6 

Trajectory of a first-order 
system with truncation 
quantization 


Figure 10.7 

Trajectory of a first-order 
system with truncation 
quantization and 


negative initial condition 
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thus conclude that the trajectory can be found by starting with a point on the 
(1/a)-line, dropping to the Q{-} staircase, projecting left to the (1/o)-line again, 
dropping again, and so on, as shown by the dashed lines and arrowheads. Note 
that the path will always end on the segment of zero amplitude where oy < q. 
Now, however, suppose the initial value of y is negative. The trajectory is plotted 
in Fig. 10.7, where projection is up to Q and to the right to the (1/a)-line. Note 
that this time the trajectory gets stuck at the point s, and the response does not 
go to zero. The point s is an equilibrium or stationary point of the equation. If 
the initial value of y had a magnitude smaller than s, then the motion would have 
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moved down to the next equilibrium where the (1/a)-line and Q{-} intersect. It 
is clear that s represents the largest value of y that is at equilibrium and that this 
value depends on œ. We should be able to find a relation between kq, the largest 
value of y at equilibrium, and o, the time constant of the filter. In fact, from 
inspection of Fig. 10.7 we see that the largest value of y at a stationary point is a 
value y — —kq such that 


—kqa < —kq+q, Or ka »5k—l, Or 


l l 
ilu. or lor setas (10.38) 

The last result is the same as the worst-case bound that would be found from 
Eq. (10.8) with the maximum value of error q for truncation rather than q /2 as is 
the case for round-off. Thus we find in the first-order system that the worst case 
is rather likely to be realized. 

In order to extrapolate these results to a higher-order system we must find a 
more subtle way to look at them. Such a viewpoint is provided by the observation 
that at the equilibrium points of Fig. 10.7 the gain of the quantizer is exactly 
1/a. If we consider the quantizer as a variable gain, then the equilibrium occurs 
at those points where the combined gain of quantizer and parameter o are unity, 
which, for a linear system, would correspond to a pole at z = 1. The extrapolation 
of this idea to higher-order systems is to examine the range of possible equivalent 
gains of the quantizers and to conjecture that the limiting motion will be no larger 
than the largest signal for which the linear system with the resulting equivalent 
gain(s) has a pole on the unit circle. 

We will illustrate the conjecture by means of the second-order control canon- 
ical form shown in Fig. 10.8(a) with the quantizer characteristic shown in Fig. 
10.8(b) corresponding to round-off rather than truncation. Note from Fig. 10.8(b) 
that the staircase of this quantizer is centered about the line of slope 1.0 passing 
through the origin. The characteristic equation for this system, if quantization is 
ignored, is 


z+az+a,=0, 
z? — 2r cos 0z +r? 2 0, 


where we assume that 0 < a, < 1 anda, corresponds to a real angle. Thus we see 
that the system will have complex roots on the unit circle only if the action of the 
quantizer is to make the effective value of a, = r? = 1.0. Thus the condition for 
an oscillation is that Q(a;y) > y. Following an analysis similar to Eq. (10.38), the 
property of the quantizer is that Q(x — q/2) x q/2. If we leta,y = kq — q/2, 
then the oscillation condition becomes 


kq —q/2 
ig cR 


a, 


Figure 10.8 
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A second-order system in control canonical form and the quantizer characteristic 


corresponding to round-off 


dither 


Q(x) 


(b) 


from which the amplitude is predicted to be less than 


l q 

kq < TESTE i 
The effect of quantization on the a,-term influences only the frequency of the 
oscillations in this model. For this purpose, because the equivalent radius is 1, 
the equation for the digital frequency is 9 = cos™'(- a,/2). 

Another benefit of the view of quantization as a variable gain is the idea that 
a second signal of high frequency and constant amplitude added to the input of 
the quantizer can destroy the limit cycle. Such a signal is called a dither, and its 
purpose is to make the effective gain of the quantizer 1.0 rather than something 
greater than 1. Consider again the situation sketched in Fig. 10.7 with the signal 
stuck at s. If an outside high-frequency signal of amplitude 3q is added to y, then 
one can expect that although the output would contain a fluctuating component, 
the average value would drift toward zero rather than remain stuck at s. If the 


frequency of the dither is outside the desired pass band of the device, then the 
result is improved response; thatis, alarge constant bias or elsea high-amplitude, 

low-frequency, self-sustained, limit-cycle oscillation can sometimes be removed 
this way at the cost of a low-amplitude, high-frequency noise that causes very 
low amplitude errors at the system output. 


(10.39) 


€ Example 10.6 


Quantization- Caused Limit Cycles and Effect of Dither 


For the system in Fig. 10.8 with a, — —1.78 and a, — 0.9, 
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Figure 10.9 

(a) Second-order 
response with and 
without quantization 
showing limit cycle. 
(b) Second-order 
response with 
quantization q = 0.1 
and dither = 4q 


(a) determine the response to the initial condition of x, = 2 and x, — 0 with q = 0 and 
g = 0.1. Compare the results to that predicted by Eq. (10.39). 


(b) Determine a dither signal that improves the response. 


Solution. A simulation of the system of Fig. 10.8 with and without quantization is shown 
in Fig. 10.9(a). The limit cycle is clearly visible for the g = 0.1 case. The system parameters 


correspond to roots at a radius of 0.95 at an angle of 20°. The quantization level was set at 0.1. 
The response with quantization clearly shows the limit cycle with an amplitude of 0.5, which is 
exactly the value given by Eq. (10.39) with a, = 0.9. The period of the limit cycle can be seen 
from the figure to be approximately 14 samples. If we compute the frequency corresponding 
to the value of a, = —1.78 with the radius taken as ] in order to reflect the fact that the system 


is oscillating, we find 0 = cos”! (-a,/2) = 27°. This angle corresponds to an oscillation with a 


period of 13.2 samples, an estimate that is quite good, considering that the true period must 
be an integral number of sample periods. 


In Fig. 10.9(b) are plotted the response with q = O and the response with q = 0.1 with 


dither added. In this case, after some experimentation, it was found that a square-wave dither 
at the Nyquist frequency worked quite well at an amplitude of 4g. In that case the steady-state 
response has been reduced in amplitude from 0.5 to 0.1 and the frequency has been increased 
from that having a period of 14 to that having a period of 2, namely the Nyquist frequency. 
The experiments found that dither of less amplitude did not remove the natural limit cycle 
and dither of higher amplitude gave a larger steady-state error response. A random dither was 


20 40 60 80 100 120 140 
60 = 20°; 4, = 1,783; az =0,9;9=0.1 


Dithered case, q = 0.1 
No dither, q = 0 


Time 
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also tried, which was found to be much less effective than the Nyquist frequency square wave. 
Unfortunately, the selection of the amplitude and signal shape of an effective dither remains 
more a matter for experimentation than theory. 


* 


10.4 


Summary 


* Multiplication round-off errors are bounded, in steady-state, according to 
Ij, (69) < 12,017. (10.10) 


where H, is the transfer function between the quantization error and the 
output; and |H, (1)] is the dc gain of HJ. 

* Multiplication round-off errors can be analyzed under most conditions by 
considering them to be an additional random input at each multiplication. 
The distribution of the random input is flat, its mean is 0, its variance is 
R = q’/12, and it can be considered to be white (no time correlation). 


The effect of the random multiplication round-off error is analyzed by using 
the discrete Lyapunov equation 


R (00) = $ R (œ) + TRY, (10.24) 


which is evaluated via MATLAB using dlyap.m or qrms.m. For 16 and 32 
bit computers, these errors are usually negligible. They could be significant 
using a computer with 8 bits or less, as will be shown in Fig. 11.6. 

* The effect of parameter round-off (or parameter storage error) is systematic 
and has the capability to render an otherwise stable system unstable. It was 
shown that a parallel or cascade implementation of the difference equations 
significantly reduces the sensitivity to this type of error as does the use of a 
large number of bits to represent the parameters being stored. 

* Under certain conditions (usually lightly damped systems), uncommanded 
oscillations will occur due to quantization called limit cycles which can be 
alleviated by the addition of low amplitude oscillations called dither. 


10.5 Problems 


10.1 For the system from Eq. (10.5) with a = 0.9 and q =.01, what is the rms of the 


quantization error of y that you expect for random input, u(k)? 


10.2 Verify Fig. 10.4 by simulating the response of the system from Eq. (10.5) to u (k) = 1 with 


and without quantization of the wy multiplication. Usea = 0.9, 9 = .02, and compute the 
response from k = 0 to k = 100, repeating the simulation several times. Compute © 
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10.3 


10.4 


10.5 


of the difference between the case with and without quantization atk = 5 andk = 95and 
compare with Fig. 10.4. Commenton the match you found and discuss any discrepancies. 


A digital lowpass filter is described by the equation 
y(k) = Q,loy(k — 1)] + Q,IC — o)u(k — 1)]. 


Assume that œ > 0 and that the input, u(k), is a slowly varying signal that, for the 
purpose of this analysis, can be approximated by a constant. The magnitude of the input 
is restricted to be less than 0.99. 


(a) If both quantizers operate with £ bits to the right of the point, what is the value of 
the quantum g for this case? 


(b) Give an expression in terms of o for the minimum value that € must have to 
guarantee that quantization error cannot cause y to exceed 1.0. 


(c) Evaluate your expression in part (b) and give the necessary bit countifo — 0.9, 


0.98, 0.995. 
(d) Suppose the input A/D quantizer Q, is fixed at 12 bits with 11 bits to the right of 


the fixed point. At what value of o can the quantization error alone cause the input 
to Q, to equal unity? 


A digital filter with the structure of Fig. 10.8 is preceded by an A/D converter having 12 
bits of accuracy, with 11 bits to the right of the fixed point. The quantizer Q, is a 16-bit 
word length scaled to have 3 bits to the left of the point. The quantizer Q, is scaled to 
have only 1 (sign) bit to the left of the point in a 16-bit word length. Let a, = —1.6 and 
a, — 0.81. 


(a) Give the quantum sizes q, for the converter and each internal round-off quantizer. 


(b) Give the relevant transfer functions necessary to compute the output quantization 
error. 


(c) Compute the steady-state worst error at the output due to quantization. 

(d) Use MATLAB to compute the worst-case bound on output quantization error. 

(e) Use MATLAB to compute the rms error due to quantization using the white-noise 
model. 

For the second-order observer canonical form shown in Fig. 10.10, 


(a) compute the transfer functions from the quantizers to the output, y. Note carefully 
how many need to be computed. 


(b) If b, =b,=1, a, =—1.6, a, = 0.81, what is the maximum steady-state error 
due to equal rounding quanta of +g /2? 


(c) Show that the stochastic state-transition model to use Eq. (10.24) on this system 
has 


16 1 0 g E 
es | d ar ial ae R (00) = es H= [i+ 0]. 


ee aed 


solve Eq. (10.24) for this case, and compute R, (0) from Eq. (10.27). 


(d) Assume that 


Figure 10.10 
A second-order system 
with quantization 


10.6 


10.7 


10.8 


10.5 Problems 447 


Solve for the rms output due to round-off for the system of Fig. 10.8 using computer 
tools if a, = —1.6 anda, = 0.81. 

The filter shown in Fig. 10.10 has input from an A/D converter that has 10 bits, 9 bits to 
the right of the fixed point. The input magnitude is restricted to 1. The four quantizers 
are based on 16-bit words scaled to have, respectively, 12, 12, 9, and 13 bits to the right 
of the points (i.e., Q, has 9 bits to the right). The parameters are a, = —1.6, a, = 0.81, 
and b, — b, — 1. 

(a) Give the quanta q, for each quantization location. 

(b) Give the respective transfer functions from each quantizer to the output. 


(c) Use MATLAB to compute the worst-case error bound on the output quantization 
error. 


(d) Use MATLAB to compute the rms output error for this system based on the 
white-noise model of quantization error. 
Consider the discrete compensation whose transfer function is 


z? 


(z — 0.5)(z — 0.55) (z — 0.6)(z — 0.65)’ 

(a) If implemented as a cascade of first-order filters with coefficients stored to finite 
accuracy, what is the smallest coefficient perturbation that could cause instability? 
Which parameter would be most sensitive? 

(b) Draw an implementation of D(z) in control canonical form with parameters 
@,, Q5, â}, and a,. Compute the first-order sensitivities of these parameters 
according to Eq. (10.36). Which parameter is most sensitive? Compare the 


D(z) = 


448 Chapter 10 Quantization Effects 


10.9 


sensitivities of root locations for the cascade and the control forms for these 
nominal root positions. 
(c) Using the root locus, find the maximum deviations possible for the parameters of 
the control canonical form and compare with the results of part (b). 
For the equivalent-gain hypothesis of limit-cycle behavior, 
(a) Show that the equivalent gain of the round-off quantizer is given by 
G < 2k E= L2 
MA = Za 
for a signal of amplitude greater than or equal to kq — q/2. 
(b) What is the largest-amplitude limit cycle you would expect for a second-order 
control canonical form system with complex roots at a radius of 0.9 and a 
quantization level of 0.01? 


10.10 For the system of Fig. 10.8, 


10.11 


(a) Use MATLAB to simulate the response to the initial conditions x, = 2 and x, = 0 
with zero input. Use a, = —,/2a, and a, = 0.8, 0.9, 0.95, and 0.98. Compare the 
amplitudes and frequencies of the limit cycles (if any) with the values predicted. 


(b) Add dither at the Nyquist frequency to the quantizer of a, y with amplitude A and 
find the dither amplitude that minimizes the peak output in the steady state. 


Repeat Example 10.6, but assume the design full scale is +20 units and the system is 
being implemented with an 8 bit cpu. Find the amplitude and frequency of the limit 
cycle, compare with the theory, and find a dither that improves the situation as much as 
possible. (Hint: Figl 009.M available from www.elliskagle.com will be useful.] 


10.12 For the system of Example 10.6, assuming the design full scale is +20 units, find the 


number of bits required in the cpu to eliminate the limit cycle without any dither. (Hint: 
Fig]009.M available from www.elliskagle.com will be useful.] 


10.13 For a system structured as in Example 10.6 with the same initial conditions, but with 


10.14 


the coefficients selected so that the equivalent damping is 4 = 0.05, assume the design 


full scale is +20 units and find the number of bits required in the cpu to eliminate the 
limit cycle without any dither. (Hinr: Fig]009.m available from www. elliskagle.com 
will be useful.] 


For the discrete system 


3 2 
T3 3 1 
De) = 2 Ze te Sei 


z* — 3.30002? + 4.07752? — 2.2358z + 0.4590 ` 


how many bits are required for parameter storage in order to keep errors in the pole 
locations to be less than 0.01 in the z-plane in either the real or complex directions? 
Assume the system is to be implemented in the direct form. 


ell- 


Sample Rate Selection 


A Perspective on Sample Rate Selection 


The selection of the best sample rate (or rates) for a digital control system is a 
compromise. Generally, the performance of a digital controller improves with 
increasing sample rate, but cost may also increase with faster sampling. A de- 
crease in sample rate means more time is available for the control calculations; 
hence slower computers are possible for a given control function, or more control 
capability is available for a given computer. Either result lowers the cost per func- 
tion. For systems with A/D converters, slower sampling means less conversion 
speed is required, which will also lower cost. Furthermore, we will see that faster 
sampling can sometimes require a larger word size, which would also increase 
cost. All these arguments suggest that the best choice when considering the unit 
product cost is the slowest sample rate that meets all performance specifications. 

On the other hand, digital controllers are often designed and built for systems 
where a very small number will be built. In this case, the cost of the design effort 
can be more than the unit product costs, and the savings in design time that are 
realized with very fast (> 40x bandwidth) sampling dictates that a higher rate is 
the best choice. 

This chapter will discuss the influence of the sample rate on system perfor- 
mance in order to give some insight into how to reconcile the issues for a given 
design. 


Chapter Overview 


Section 11.1 examines the fundamental limit on the sample rate imposed by the 
sampling theorem; Section 11.2 covers the effect on the time response, smooth- 
ness, and time lags; Section 11.3 examines the regulation effectiveness as mea- 
sured by the response errors from random disturbances; Section 11.4 looks at the 
effect of sample rate on the sensitivity to plant parameter variations; and Section 
11.5 examines how sample rate affects the error due to the measurement noise 
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11.1 


sample rate lower bound 


Figure 11.1 


and the influence of analog prefilters or antialiasing filters on this error. Since 
there are often conflicts in selecting the sample rate for different functions in a 
controller, some designers elect to have more than one sample rate (a “multirate 
system") and this topic is discussed in Section 11.6. 


The Sampling Theorem’s Limit 


An absolute lower bound for the sample rate would be set if there is a specification 
to track certain command or reference input signals. This bound has a theoretical 
basis from the sampling theorem discussed in Section 5.2. Assuming we can 
represent our digital control system by a single-loop structure as depicted in 
Fig. 11.1, we can specify the tracking performance in terms of the frequency 
response from r to y. The sampling theorem states that in order to reconstruct an 
unknown band-limited continuous signal from samples of that signal, one must 
use a sample rate at least twice as fast as the highest frequency contained in 
the unknown signal. This theorem applies to a feedback controller like the one 
illustrated in Fig. 11.1, because r is an unknown signal that must be followed by 
the plant output y. If we want the system to track r up to a certain closed-loop 
bandwidth, w,, it follows that r will have spectral content up to that frequency. 
Based on the sampling theorem therefore, the sample rate must be at least twice 
the required closed-loop bandwidth of the system, that is 


A o” (11.1) 
w, 

If a designer were to specify a certain value of the closed-loop bandwidth 
and then pick a sample rate that violated Eq. (11.1), the result would be that 
the sampled values of r would be aliased as discussed in Section 5.2, and the 
result would be a system response that was unstable or considerably slower than 
specified. 

A similar argument can be made with regard to the closed-loop roots of 
a system, which typically would be slightly slower than the bandwidth. If a 
designer specified a closed-loop root by its s-plane location and failed to sample 
at twice its frequency, the actual root realized will be aliased and can have little 


Digital control system schematic 


11.2 
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resemblance to that specified. In practice, no designer would consider such a low 
sample rate; we bring it up only because it marks the theoretical lower limit of 
possibilities. 


Time Response and Smoothness 


Equation (11.1) provides the fundamental lower bound on the sample rate. In 
practice, however, this theoretical lower bound would be judged far too slow 
for an acceptable time response. For a system with a rise time on the order of 
l sec (which translates to a closed-loop bandwidth on the order of 0.5 Hz), it 
would be typical to choose a sample rate of 10 to 20 Hz in order to provide some 
smoothness in the response and to limit the magnitude of the control steps. This 
means that the desired sampling multiple (= w,/w,) for a reasonably smooth 
time response is 


w 
20 < — < 40. (11.2) 


o, 


Double Integrator Control Smoothness vs. Sample Rate 


Compute the unit step response of the double integrator control problem as developed in 
Example 8.1 and plot the output x,, its rate x,, and control u time histories. Find the feedback 
gains for sample rates of 4, 8, 20, and 40 times the bandwidth, w,, so that the responses all 
have closed-loop roots in the z-plane that are equivalent to s = 0.5 c, (1 + j). Discuss your 
results. 


Solution. The four responses are shown in Fig. 11.2. The gains were computed using pole 
placement where the z-plane poles were computed using z = e". Note that the relation 
s — 0.5 e, (1 + j) is approximate in that the actual bandwidth of the closed-loop frequency 
response may be slightly different. 

It is interesting to note that the x, response was smooth for all cases, including the one 
with w/w, = 4; however, the acceleration had large discontinuities and would have had a 
strong tendency to excite any flexible modes and produce high stresses in the actuator and 
its surrounding structure. In turn, these acceleration steps produced noticeable changes of 
slope in the velocity. A sampling multiple of w,/w, 7 20 appears necessary for a reasonable 
smoothness. 


* 


The degree of smoothness required in a particular design depends on the 
application and is highly subjective. The commands issued to an electric motor 
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Figure 11.2 

Double integrator step 
response for the 
sampling multiple with 
«a [ay equal to (a) 4, 


(b) 8, (c) 20, and (d) 40 


time delay 


Outputs 
Outputs 


Outputs 


can have large discontinuities, whereas the commands issued to hydraulic actu- 
ators are best kept fairly smooth.' The tolerance to roughness in the response 
also depends on the application; for example, if a person is being affected by the 
controller, a smooth ride is likely desirable. An unmanned satellite controller can 
be rough; however, slow sampling contributes to increased pointing errors from 
disturbances, a topic that is discussed in the following section. 

In addition to the smoothness issue, it is sometimes important to reduce the 
delay between a command input and the system response to the command input. A 
command input can occur at any time throughout a sample period; therefore, there 
can be a delay of up to a full sample period before the digital controller is aware 
of a change in the command input. All the responses in Fig. 11.2 assumed that the 
controller was aware of the command input at time zero and that, therefore, all the 
attitude responses, x,, responded in a similar fashion. For systems with human 
input commands where the system response is critical (such as fly-by-wire flight 
control), the time delay alone suggests that the sample period be kept to a small 
fraction of the rise time. A pilot flying an airplane with digital fly-by-wire flight 
control will complain if the sampling delay is on the order of a tenth of a second 
from input action to the beginning of the response. Assuming we wish to keep 


| Sometimes lowpass filters are placed between the ZOH output and the actuators to soften the 
discontinuities, but the filters must be taken into account during design and compensated for. 
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the time delay to be 10% of the rise time, a 10-Hz sample frequency should be 
used for 1 sec rise time or, in terms of the nondimensional sampling multiple 


25 > 20. (11.3) 


w, 


Double Integrator Response vs. Sample Rate 


Repeat Example 11.1, but add a one cycle delay between the input command and the start of 
the control input in order to assess the worst case phasing of the input. Discuss the impact of 
this issue. 


Solution. The result is shown in Fig. 11.3. It demonstrates the noticeable impact on the 
response for the two slower cases. The overshoot is unchanged because the controller was 
adjusted in each case to maintain the same z-plane roots, but the extra delay affected the rise 
time substantially as measured from the instant of the input command. 


Figure 11.3 

Double integrator step 
response with worst case 
phasing between 
command input and the 
sampler with w,/w, 
equal to (a) 4, (b) 8, 

(c) 20, (d) 40 
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Figure 11.4 

Block diagrams of the 
systems for disturbance 
analysis 


Errors Due to Random Plant Disturbances 


Disturbance rejection is an important aspect of any control system and, in many 
cases, is the most important one. In fact, Berman and Gran (1974) suggest that 
the sample rate for aircraft autopilots should be selected primarily on the basis 
of its effect on disturbance rejection. Indeed, this is the case for many control 
applications, although there are multitudes of applications with widely varying 
conditions where other factors are more important. 

Disturbances enter a system with various characteristics ranging from steps 
to white noise. For determining the sample rate, the higher frequency random 
disturbances are the most influential; therefore, we will concentrate on their 
effect. In other words, we will look at disturbances that are fast compared to the 
plant and the sample rate, that is, where the plant noise can be considered to be 
white. 

The ability of the control system to reject disturbances with a good continuous 
controller represents a lower bound on the magnitude of the error response that can 
be hoped for when implementing the controller digitally. In fact, some degradation 
over the continuous design must occur because the sampled values are slightly 
out of date at all times except at the very moment of sampling. In order to analyze 
the degradation of the digital controller as compared to the continuous controller, 
it is important to consider the effect of the noise [w in Eq. (4.45)] consistently, 
with both the continuous and the digital controllers. 

The block diagram in Fig. 11.4 shows the situation. The plant noise, generally 
a vector quantity, w, is continuous in nature and acts on the continuous part of 
the system independently of whether the controller is continuous or discrete. 
Furthermore, we are at liberty to analyze the effect of w with a certain power 
spectral density regardless of what values of R,, or R, were used in computing the 
estimator gains, L, and regardless of whether an estimator was used. Proceeding 
then, we have a continuous system represented by 


X = Fx + Gu + G,w, (11.4) 


Continuous 
controller 


Discrete 
controller 
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where the power spectral density of w is R,,., (alternatively referred to as the 
“white-noise intensity” or “mean-square spectral density") so that the covariance 
of w is 


E[w(t)w7(t + 1)] = R, 49 (T). 
The steady-state value of the covariance of x is given by the Lyapunov equation? 


FX + XF” + G R paG = 0. (11.5) 
The solution to this equation, X, (= E[x(z)x(t)"]) represents the amplitude of 
the random response of the state due to the excitation from w. It will be used to 
establish a baseline against which discrete controllers are compared. Note that 
the system matrices, F and G, can represent a closed-loop system including a 
continuous controller. The solution is obtained by lyap.m in MATLAB. 

It is also necessary to evaluate X when the system has a digital controller 
for the identical excitation applied to the plant. In order to do this, the entire 
system must be transferred to its discrete equivalent as given by Eq. (9.81) 
and discussed in Section 4.3. The discrete equivalent of Eq. (11.5) is given by 
Eqs. (9.90), (10.24), and (D.34) in slightly different contexts. The desired result 
for our purposes here, called the discrete Lyapunov equation (see dlyap.m in 
MATLAB) is 


PXP” +C, =X, (11.6) 


where 
T 
C,= Í P(t)G R GT P (r)dr. (11.7) 
0 


As discussed in Section 9.4.4, this integral can be approximated if T is shorter 
than all system time constants by 


C ZTR I7, whee R,= E (11.8) 
When the approximation is not valid, it is necessary to evaluate Eq. (11.7) exactly 
using Van Loan's (1978) algorithm, which can easily be done using disrw.m in the 
Digital Control Toolbox. Therefore, in order to evaluate the effect of sample rate 
on the performance of a controller in the presence of white plant disturbances, we 
first evaluate Eq. (11.5) to find the baseline covariance (X) and then repeatedly 
evaluate Eq. (11.6) with varying sample rates to establish the degradation versus 
sampling. It usually suffices to examine the diagonal elements for a performance 
measure and to compute their square roots to find the rms value, the quantity that 
is typically measured. 


2 See Kwakernaak and Sivan (1972). 
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€ Example 11.3 Double Integrator Disturbance Response vs. Sample Rate 


Examine the effect of sample rate on the performance of a digital control system compared to 
a continuous control system for the double integrator plant used in Examples 11.1 and 11.2. 


(a) 


(b) 


Assume the plant is driven by white noise entering in the same way as the control input 
with Rye <4 = | and assume the use of full-state feedback. 
Assume the same plant noise, but now use an estimator with only x, measured with 


additive noise with R, = 1. Repeat this case assuming there is quantization in the estimator 
equivalent to 7, 8, and 9 bit word size. 


Solution. 


(a) 


(b) 


The open loop F, G, and H are given by Eq. (4.47). The control gain, K, was determined 
by selecting optimal weighting matrices so that the closed-loop system had roots at 
s=05,(1£/)(¢ = 0.7) and we assume that w, = V2w,. With full-state feedback the 
closed-loop continuous system matrix is given by F, = F — GK. The plant disturbance 
noise enters the plant in precisely the same way that the control does, that is, G, — 
G. Therefore the response of a hypothetical continuous controller is found by solving 
Eq. (11.5) with F replaced with F, and G, as stated. Because we wish to illustrate only 
the degradation of the discrete controller compared to the continuous one, the choice of 
R, ose = 1 has no effect on the results. 

The family of discrete controllers with different sample periods were all designed 
to the same continuous cost function according to the method of Section 9.3.5, the idea 
being that all the discrete controllers should be trying to do the same thing. Each discrete 
design resulted in a unique K and system matrix ®, (=  — PK), which was used in 
Eq. (11.6) to evaluate X. Because this example had no dynamic characteristics that were 
faster than the slowest sample period, the approximation for the plant noise given by 
Eq. (11.8) could have been used, although the exact calculation of Eq. (11.7) was actually 
used. The result shown in Fig. 11.5 is the ratio of the rms values for the discrete case to 
the continuous case. The specific curve shown is for the rms of the x, variable; however, 
both rras ratios are essentially identical. 

If white plant disturbances were the dominant source of error in the system, one 
could conclude from this example that a sampling multiple of 


2s =~ 29 (11.9) 


w, 


would be a good choice. The relative errors grow quickly when sampling slower than this 
multiple, whereas the gain by sampling faster is simply to reduce the degradation from 
about 20% downward. 


We can analyze the disturbance response of the system when the controller includes an 
estimator in a similar manner to that above. In this case, the system matrix ® in Eq. (11.6) 
is defined by Eq. (8.62). Note that ® now includes the plant dynamics as well as the 
estimator dynamics. The continuous control roots are in the same location as for part (a) 
and the continuous estimator roots were selected twice as fast as the control roots. The 
family of discrete controllers were found by using pole placement so that the discrete 
control and estimator roots were related to the continuous roots by 2 — e°". The lowest 


Figure 11.5 

Discrete controller 
degradation versus 
sample rate for full state 
feedback and driven by a 
white disturbance, 
Example 11.3 


Figure 11.6 

Discrete controller 
degradation versus 
sample rate for the case 
with an estimator with 
quantization errors and a 
white disturbance, 
Example 11.3. 
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curve in Fig. 11.6 shows the discrete to continuous rms ratio and, compared to Fig. 11.5, 
is slightly higher. This is to be expected because the velocity information is now being 
estimated from a position measurement and the greater sensitivity to disturbances is the 
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result of the approximate differentiation that is occurring. Although the curve is generally 
higher at all sample rates, the conclusion concerning sample rate selection for the case 
where disturbances are the dominant consideration is the same; that is, sample at 20 times 
the bandwidth or higher. 


The addition of the random noise from quantization shows that there are limits to the 
improving noise response as the sampling rate increases. Figure 11.6also includes quantization 
noise added according to the discussion in Section 10.1. Quantization is usually important only 
with a fixed-point implementation of the control equations, and an assumption was therefore 
required as to the scaling of the signals for this analysis. In Fig. 11.6 it is assumed that the 
controller was scaled so that the continuous rms error due to the plant disturbance is 296 of 
full scale, a somewhat arbitrary assumption. But the point is not the specific magnitude as 
much as the notion that there is a limit, and that if a designer is dealing with a microprocessor 
with fewer than 12 bits, it can be useful to perform a similar analysis to determine whether 
the word-size errors are sufficiently large to impact selection of sample rate. With 16- and 
32-bit microprocessors and a parallel or cascade realization, the increase in rms errors due to 
quantization at the fast sample rates is typically so small that word size is not an issue and no 
practical upper limit to the sample rate exists. However, if using an 8-bit microprocessor, it 
may be counterproductive to use too high a sample rate. 


* 


The example, although on a very simple plant, shows the basic trend that 
usually results when a white disturbance acts on the plant: The degradation due 
to the discrete nature of the control over that possible with a continuous control 
is significant when sampling slower than 10 times the bandwidth. Except for 
controllers with small word sizes (8 bits or less), the performance continues to 
improve as the sample rate is increased, although diminishing returns tend to 
occur for sampling faster than 40 times the bandwidth. 

Whether the rms errors due to plant disturbances are the primary criterion 
for selecting the sample rate is another matter. If cost was of primary importance 
and the errors in the system from all sources were acceptable with a sample 
rate at three times bandwidth, nothing encountered so far in this chapter would 
necessarily prevent the selection of such a slow sample rate. 

On the other hand, resonances in the plant that are faster than the bandwidth 
sometimes can have a major impact on sample rate selection. Although they do 
not change the fundamental limit discussed in Section 11.1, they can introduce 
unacceptable sensitivities to plant disturbances. The analysis is identical to that 
used in the example above, except that it is mandatory in this case that an 
accurate evaluation of the integral in Eq. (11.7) be used because there are some 
plant dynamics that are faster than the sample period. 
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Figure 11.7 

Discrete controller 
degradation for the 
double mass-spring 
system using full state 
feedback, Example 11.4 


Double Mass-Spring Disturbance Response vs. Sample Rate 


Repeat Example 11.3 for the double mass-spring system that was used in Examples 8.3 and 
9.5 and described in Appendix A.4. Do part (a) for full-state feedback and part (b) for an 
estimator based on a measurement of d, that is, the noncolocated case where the resonance is 
between the sensor and the actuator. 


Solution. The parameter values used are as given in Example 8.3, except that we will 
consider the case with no natural damping as well as the lightly damped (Z — 0.02) case 
used previously. The optimal continuous design was carried out in order to achieve a 6:1 
ratio between the resonant mode frequency, w,, and the system bandwidth, w,. Weighting 
factors were applied to d and y, which provided good damping of the rigid body mode and 
increased slightly the damping of the resonance mode. The family of discrete designs were 
found by using pole placement so that the discrete roots were related to the continuous roots 
by z = ef". The result for the full state feedback case is shown in Fig. 11.7. The general 
shape of the curves is amazingly similar to the previous example, but there are three highly 
sensitive sample rates at 2 w,,w,, and w,/2. Note that these are no! resonances in the traditional 
sense. The curves represent the ratio of the response of the discrete system compared to the 
continuous, so that large spikes mean that the discrete controller at that sample rate exhibits 
poor disturbance rejection compared to the continuous controller. The excitation is broad band 
for both controllers at all sample rates. Also note that there is no peak at 2 w, for the case with 
£ = 0.02. 

The origin of the sensitivity peak at exactly w, is that the controller has no information 
about the resonance at these critical sample rates. The sampled resonance value will be constant 
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v0- zeta=0 


= zeta- 0.02 


Discrete/continuous RMS 


10° 10! 10? 
Sampling multiple, o /Q, 


460 Chapter 11 


Sample Rate Selection 


no matter what the phase relationship between sampler and the resonant mode, thus producing 
an unobservable system. The additional peak at 2w, arises from a similar unobservability, 
except that in this case the unobservability occurs only when the phasing is such that the 
samples are at the zero crossings. This peak is significantly less likely to occur and vanishes 
with a small amount of natural plant damping. 

Figure 11.8 shows the sensitivity of the discrete controller to disturbance noise for the 
more realistic case where the full state feedback has been replaced with an estimator using the 
measurement of d (essentially a notch filter). As might be expected, the sensitivity peaks have 
become more serious. The figure shows a wider sample rate band of high sensitivity about 
each of the previous trouble points and additional peaks at 2w, and 2o. So we now see that 
there are sensitive sample rates at all integer fractions of 2w, due to both unobservabilities 
noted above. 


Figure 11.8 

Discrete controller 
degradation for the 
double mass-spring 
system using an 
estimator with the 
measurement from d, 
Example 11.4 


+ 


Example 11.4 demonstrates that, in some cases, a digital controller with 
sample rates centered about 2c, or integer fractions of that value has difficulty 
and amplifies the effect of random disturbances acting on the plant. Factors that 
influence whether these sensitive sample rates exist have been studied by Hirata 
(1989) and generally show that the amount of damping added by the controller 
is the key aspect. Had more damping been added to the resonant modes by the 
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3 The phenomenon was noted and discussed by Katz (1974) and analyzed by Hirata (1989). 
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controller than shown by the example, the sensitivity peaks would have been more 
pronounced because use of the mode information by the controllers would have 
been more critical to the performance. Furthermore, use of a colocated sensor 
and actuator—that is, using the measurement of y instead of d—produces results 
that are very little different than those shown for the noncolocated case, provided 
that the controller is adding some damping to the resonant mode. 

The impact of this is that, for systems where the controller is adding some 
damping to a lightly damped mode, the only safe place to select a sample rate is 
faster than twice the resonant frequency, that is 


w, > 20, (11.10) 


There is a possibility to pick the sample rate in one of the "valleys" in Fig. 11.8; 
however, resonant frequencies can often vary considerably, thus rendering this 
approach unreliable, and would cause this system to lack "robustness." For sys- 
tems with high-frequency resonant modes with adequate natural damping and to 
which the controller adds no damping, there are typically no sensitive sample 
rates related to the resonant mode, and these considerations can be ignored when 
selecting the sample rate. Note that these resonant modes may be considerably 
faster than the system bandwidth and, therefore, sampling faster than 20 times 
bandwidth may be advisable in order to eliminate the possibility of these sensitive 
sample rates. 

In summary, the sample rate has a major impact on how well a digital con- 
troller performs in the presence of plant disturbances. Performance approaching 
that of a continuous controller can be achieved providing the sample rate is 20 
or more times faster than the bandwidth for systems with no significant dynam- 
ics faster than the bandwidth. For systems where the controller is adding some 
damping to a lightly damped mode that is faster than the bandwidth, the sample 
rate should also be at least twice that resonant frequency. 


Sensitivity to Parameter Variations 


Any control design relies to some extent on a knowledge of the parameters 
representing plant dynamics. Discrete systems generally exhibit an increasing 
sensitivity to parameter errors for a decreasing «,. The determination of the 
degree of sensitivity as a function of sample rate can be carried out by accounting 
for the fact that the model of the plant in the estimator (or compensation) is 
different from the actual plant. 

In Section 8.3.1, we assumed that the actual plant and the model of the plant 
used in the estimator were precisely the same. This resulted in the separation 
principle, by which we found that the control and estimation roots designed in- 
dependently remained unchanged when the feedback was based on the estimated 
state. We will now proceed to revise that analysis to allow for the case where the 
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parameters representing the plant model in the estimator are different from those 
of the actual plant. 
Let us suppose the plant is described as 


x(k 4-1) = 9 x(k) + r u(k), 
y(k) = Hx(k), (11:11) 
and the current estimator and controller as 
K(k) = x(k) + L(y(k) — Hx(k)), 
x(k +1) = o x(k) + Tuck), 
u(k) = —Kx(k). (11.12) 


In the ideal case, «p. = ®, (= ®) and EL, =T, (=P), and the system closed- 
loop roots are given by the controller and the estimator roots designed separately; 
that is, they are the roots of the characteristic equation [similar to Eq. (8.55)] 


|z1 — (® — FK] || zI — (® — LH®)| = 0. (11.13) 


If we allow ®, + P, and I, # T, the roots do not separate, and the system 
characteristic equation is obtained from the full 2n x 2n determinant that results 
from Eqs. (11.11) and (11.12) 


vo — liz — DK 
LH® , (I - LH)(®, — TK) — LHL,K -Iz | 


Either the pole-placement approach of Chapter 8 or the steady-state, optimal, 
discrete design method of Chapter 9 could be used to arrive at the K- and L- 
matrices. In both cases, the root sensitivity is obtained by assuming some error 
in ® or T (thus ®, 4 ®, and T, z I) and comparing the resulting roots from 
Eq. (11.14) with the ideal case of Eq. (11.13). 

If asystem has been designed using the methods of Chapter 7, root sensitivity 
is obtained by repeating the closed-loop root analysis with a perturbed plant or, 
if one parameter is particularly troublesome, an analysis of a root locus versus 
that parameter might be worthwhile. 


(11.14) 


€ Example 11.5 


Robustness vs. Sample Rate 


The equations of pitch motion of a high-performance aircraft where there is some fuselage 


bending are* 
i, = M, TM XS ab M, be 
4, =q +Z tZ ô. 
X, = 0X, 


4,2 —0,x,— 26 ox, + w, KZ, be + K,0,Z Xy, (11.15) 


4 This example is due to Katz (1974). 
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where 


x= pitch rate, 
x, = angle of attack, 
X4, X, = position and velocity of bending mode, 


ô, = elevator control surface, 


M's, Z's = aircraft stability derivatives, 
č, = bending-mode damping, 
w, = bending-mode frequency, and 
K,, K, = quantities depending on the specific aircraft shape 


and mass distribution. 


In the flight condition chosen for analysis (zero altitude, Mach 1.2), the open-loop rigid- 
body poles are located at s = —2 + j 13.5 rad/sec, and the bending mode is lightly damped 
(¢ = 0.01) with a natural frequency of 25 rad/sec (4 Hz). The control system consists of a 
sensor for X, (arate gyro), an estimator to reconstruct the remainder of the state, and a feedback 
to the elevator to be added to the pilot input. The purpose is to change the natural characteristics 
of the aircraft so that it is easier to fly, sometimes referred to as stability augmentation. 

The closed-loop poles of the rigid body were located by optimal discrete synthesis to 
s = —l6 + j10 rad/sec with essentially no change in the bending-mode root locations. The 
optimal compensator (control law plus estimator) generates a very deep and narrow notch 
filter that filters out the unwanted bending frequencies. The width of the notch filter is directly 
related to the low damping of the bending mode and the noise properties of thc system. 
Furthermore, the optimal estimator gains for the bending mode are very low, causing low 
damping in this estimation error mode. If the bending frequency of the vehicle varies from the 
assumed frequency, the components of the incoming signal to the estimator due to the bending 
miss the notch and are transmitted as a positive feedback to the elevator. 

Examine the sensitivity to errors in the bending frequency versus the sample rate. 


Solution. Figure 11.9 shows the closed-loop bending mode damping 2, as a function of the 
ratio of the sample rate w, to bending mode frequency w,. Note the insensitivity to sample 
rate when knowledge of the bending mode is perfect (no error in w,) and the strong influence 
of sample rate for the case where the bending mode has a 10% error. For this example, the 
bending mode was unstable (¢_ < 0) for all sample rates with a 10% w, error, indicating a 
very sensitive system and totally unsatisfactory design if w, is subject to some error or change 
from flight conditions. 

The sensitivity of the controlled system to errors in w, can be reduced using any one of 
many methods for sensitivity reduction, which are often referred to as robust design methods. 
One particularly simple method entails increasing the weighting term in Q, which applies to the 
bending mode. The effect of this approach is to increase the width of the notch in the controller 
frequency response, thus making the controller more tolerant to changes in the bending-mode 
frequency. Figure 11.10 shows the effect on Z, as a function of the fourth element in Q,, which 
we will refer to as the bending-mode weighting factor Q, ,,. It demonstrates a substantial 
increase in robustness. Note that it is theoretically possible to have Z, > 0.05 with a 10% w, 
error and a sample rate only 20% faster than the bending mode; however, this ignores the 
disturbance-noise amplification that will occur at w, = 2 w, discussed in the previous section. 
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Figure 11.9 k 

Closed-loop 

bending-mode damping 0.20 

of Example 11.5 versus 

Os No error in w, 
—0.25 
—0.50 10% error in c, 
—0.75 

Figure 11.10 

Closed-loop 


bending-mode damping 
versus Q, 44 for Example 
11.5 with a 1096 w, error 


With a sufficiently high value of Q, ,,, the cxample becomes "robust" in the sense that it is 
stable for a 10% c, error, but it continues to show an increasing sensitivity to the w, error for 


decreasing sample rate. 
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antialiasing filter 
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In summary, for the ideal case where the plant parameters are known exactly, 
there is no effect of sample rate on bending-mode damping or any other closed- 
loop dynamic characteristic. On the other hand, if there is some error between the 
plant parameters used for the controller design and the actual plant parameters, 
there will be an error in the desired closed-loop characteristics that increases with 
the sample period. In most cases, the use of reduced performance requirements 
or of robust design practice such as shown by the example can reduce the error 
to acceptable levels and thus does not impose extra criteria on the sample rate. 
However, sensitivity of the system to off-nominal parameters should be evaluated, 
and in some cases it might be necessary to design specifically for better robustness 
and, in rare cases, to increase the sample rate over that suggested by other 
considerations. 


Measurement Noise and Antialiasing Filters 


In addition to the random plant disturbances, w, that were evaluated in Section 
11.3, there are usually some errors in the measurement or unmodeled bending 
mode oscillations as indicated by v in Eq. (9.82). In this section, we wish to 
examine theeffect of the samplerate on the response of the system to measurement 
errors. These errors are affected significantly by the presence of analog filters 
(called antialiasing filters or prefilters), which are typically placed between 
an analog sensor and the sampler in order to reduce aliasing of high-frequency 
components of the signal. Therefore, we will discuss antialiasing filters as well. 

Figure 11.11 depicts the typical arrangement of the antialiasing filter. Digital 
control systems are arranged this way for many cases in order to prevent the 
aliasing of the higher-frequency components of v. Exceptions are those cases 
where the sensor is fundamentally digital so that there is no analog signal that 
can contain high frequency noise; for example, an optical encoder provides a 
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digitized signal directly. Antialiasing filters are low pass, and the simplest transfer 
function is? 


w 
kai P 

Haee apr (11.16) 
so that the noise above the prefilter breakpoint [w, in Eq. (11.16)] is attenuated. 
The design goal is to provide enough attenuation at half the sample rate (w, /2) 
so that the noise above w,/2, when aliased into lower frequencies by the sampler, 
will not be detrimental to the control-system performance. The basic idea of 
aliasing was discussed in Section 5.2 and shown to be capable of transforming a 
very-high-frequency noise into a frequency that is well within the bandwidth of 

the control system, thus allowing the system to respond to it. 


€ Example 11.6 


Effect of Prefiltering on Aliasing 


For a 1-Hz sine wave with a 60-Hz sine wave superimposed to represent noise, find a prefilter 
that will eliminate distortions when sampled at 28 Hz. Demonstrate the effect of the prefilter, 
and lack of one, by plotting the signals over 2 cycles. 


Solution. Figure 11.12(a) shows the 1-Hz sine wave with a 60-Hz sine wave superimposed 
to represent measurement noise. If this analog signal is sampled as is, the high-frequency noise 
will be aliased; Fig. 1 1.12(b) shows the results for sampling at w, = 28 Hz and we see that the 
60-Hz noise has been changed to a much lower frequency and appears as a distortion of the 
original sine wave. Fig. 11.12(c) shows the results of passing the noisy signal in (a) through 
a first order antialiasing filter with a breakpoint, o, = 20 rad/sec (3.2 Hz). The breakpoint 
was picked considerably below the noise frequency so there is a large attenuation of the noise; 
and yet sufficiently above the 1-Hz signal so it was not attenuated. Sampling this clean signal 
results in the faithful reproduction of the signal shown in Fig. 11.12(d). 

For an analog control system with a bandwidth on the order of 1 Hz, the 60-Hz noise 
would be too fast for the system to respond, and the noise would not be apparent on the system 
output. For a 1-Hz bandwidth digital control system without an analog prefilter, aliasing of the 
noise to a lower frequency as shown in Fig. 11.12(b) would allow the system to respond, thus 
producing sensitivity to errors that do not exist for analog controllers. Thus we see the reason 
for using antialiasing filters. 


* 


To study the effect of sensor noise on sample-rate selection, we use the 
same analysis procedures that were used for the random plant disturbances. 


5 Higher-order filters are also used in order to obtain better high-frequency attenuation with minimal 
low-frequency phase lag; common filter types are Bessel, Butterworth, and ITAE. See Franklin, 
Powell, and Emami-Naeini (2019) or Astrém and Wittenmark (1997) for details. 


Figure 11.12 
Demonstration of the 
effects of an antialiasing 
filter or prefilter, 
Example 11.6. (a) Signal 
plus noise; (b) samples 
of (a) at w, = 28 Hz; 

(C) signal in (a) passed 
through antialiasing 
filter; (d) sampling of 
signal in (c) 
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Equation (11.5) is used to determine the system output errors due to continuous- 
measurement noise acting on a continuous controller for comparison purposes, 
and then Eq. (11.6) is used to determine the degradation for digital controllers 
with various sample rates. The only difference from the cases examined in Section 
11.3 is that the noise to be examined enters on the plant output instead of on the 
plant input. A complication arises, however, because the prefilter breakpoint, c, 
is a design variable whose selection is intimately connected with the sample rate. 
It is therefore necessary to examine both quantities simultaneously. 

A conservative design procedure is to select the breakpoint and o, sufficiently 
higher than the system bandwidth so that the phase lag from the prefilter does not 
significantly alter the system stability; thus the prefilter can be ignored in the basic 
control system design. Furthermore, for a good reduction in the high-frequency 
noise at w, /2, the sample rate should be selected about five times higher than 
the prefilter breakpoint. The implication of this prefilter design procedure is that 
sample rates need to be on the order of 30 to 50 times faster than the system 
bandwidth. This kind of conservative prefilter and sample-rate design procedure 
is likely to suggest that the sample rate needs to be higher than the other factors 
discussed in this chapter. 

An alternative design procedure, due to Peled (1978), is to allow significant 
phase lag from the prefilter at the system bandwidth and thus to require that the 
control design be carried out with the analog prefilter characteristics included. 
Furthermore, the analysis procedure described above is carried out to determine 
more precisely what the effect of sampling is on the system performance. This 
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procedure allows us to use sample rates as low as 10 to 30 times the system 
bandwidth, but at the expense of increased complexity in the design procedure. 
In addition, some cases can require increased complexity in the control imple- 
mentation to maintain sufficient stability in the presence of prefilter phase lag; 
that is, the existence of the prefilter might, itself, lead to a more complex digital 
control algorithm. 


€ Example 11.7 


Measurement Noise Effects vs. Sample Rate and Prefilter Breahpoint 


Use the double integrator plant as in Examples 11.1, 11.2, and 11.3 to demonstrate the degra- 
dation of the digital system response vs. the continuous control for various sample rates and 
prefilter breakpoints. Assume that only x, is available for measurement and use an estimator 
to reconstruct the state. Use the antialiasing filter as shown in Eq. (11.16). 


Solution. The results of the digital controllers with the various prefilter breakpoints, c, , 
were all normalized to the results from a continuous controller with no prefilter. The exact 
evaluation of the integral in Eq. (11.7) was required because the sample rate was on the 
same order as c, for some sample rates studied. The continuous system was designed using 
the LQR and LQE methods of Chapter 9 so that the control roots were at s = —1.5+ j1.5 
rad/sec and the estimator roots were at s = —3.3 + j3.3 rad/sec. All the digital controllers 
had roots at the discrete equivalent of those s-plane locations by using z = e^. The results 
of the normalized rms values of the output error due to white continuous-measurement noise 
entering the system as shown in Fig. 11.11 are shown in Fig. 11.13 for four different values of 
the prefilter breakpoint. 

Note in the figure that, for sampling multiples below w,/w, = 40, the performance 
improves as the prefilter breakpoint decreases, even though it includes the case where the 
breakpoint is only twice the bandwidth. If a system is dominated by measurement noise as in 
this example and the designer chooses to limit the discrete degradation to 2096 compared to 
the continuous case, the figure shows that a sampling multiple of 


c, [oy > 25 (11.17) 


is adequate providing the prefilter breakpoint ratio w,/@, < 5, whereas a sampling multiple 
of w,/w, = 40 is required if the more conservative prefilter breakpoint ratio of w/o, = 10is 
used. An even slower sample rate would be adequate if a slower breakpoint was acceptable. 
Also note in the figure at the very fast sample rates (w/w, > 200) that the rms response 
decreased as the prefilter breakpoint increased, the opposite trend to that at the lower sample 
rates. This observation has little effect on the overall design because all values were within 
10% of the continuous case for all the fast sample rates. However, there are detrimental effects 
of prefilters that can be more serious (see Hirata, 1989), and their influence on random plant 
disturbances should also be evaluated according to Section 11.3 before a design is finalized. 


* 


Figure 11.13 

Root mean square 
response of Example 
11.7 to white sensor 
noise showing effects of 
prefiltering, @,, and 
sampling, w, 


11.6 


11.6 Multirate Sampling 469 


Discrete/continuous RMS 


100 10! 102 103 
Sampling multiple, o 1o 


In summary, there are two main conclusions. The first is that prefilters for 
removing high frequency measurement noise are effective and a methodology 
has been presented for their design. The second is that the bandwidth of the 
prefilter should be selected primarily for the reduction of sensor noise effects, 
and values approaching the system closed-loop bandwidth (w, /w, = 2) should be 
considered even though their phase lag needs to be adjusted for in the controller 
design. An exception to this would occur if the prefilter lag was sufficient to 
require a more complicated controller, because this increase in computation 
could offset any cost gains from slower sampling. 


Multirate Sampling 


In multi-input, multi-output (MIMO) systems, it is sometimes useful to use differ- 
ent sample rates on the various measurements and control variables. For example, 
in the tape-drive design in Chapter 9, one could logically sample the tension mea- 
surement, 7, in Fig. 9.18, faster than the tape position measurement, x,, because 
the tension results from faster natural dynamics than the tape positioning. For 
MIMO systems with significant time constant differences in some natural modes 
or control loops, improvements in performance can be realized by sampling at 
different rates; such a system is referred to as a multirate (MR) system. In this 
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successive loop closure 


section we will address the issues of analysis of an MR system and selection of 
the sample rates. 

The z-transform method that is presented in Chapter 4 and is the basis for 
the analysis of discrete systems throughout this book does not apply to MR 
systems. Considerable research on the subject was carried out in the 1950s and 
resulted in the “switch decomposition" method, which has been reviewed by 
Ragazzini and Franklin (1958) and Amit (1980) and extended to multivariable 
systems by Whitbeck and Didaleusky (1980). The key idea in this method is that 
it is possible to analyze an MR system by reducing it to an equivalent single- 
rate system operating at the longest sample period of the system. The “Kranc” 
operator’ facilitates this transformation. 

Another approach uses a state-space description due to Kalman and Bertram 
(1959). The difference equations have coefficients in ® and F that change pe- 
riodically at the longest sample period. An MR analysis procedure with this 
approach is contained in Berg, Amit, and Powell (1988) and, similarly to switch 
decomposition, reduces the MR system to an equivalent single-rate system where 
conventional methods can be employed. The computational burden for both the 
switch-decomposition and the state-space MR analysis is substantial; therefore, 
digital control designers typically opt for design methods during the synthesis 
process which bypass the need for using one of the exact MR analysis tech- 
niques. 

An analysis method often used is called successive loop closure (SLC). 
It requires that the system initially be decoupled into two or more single-rate 
subsystems along the lines discussed in Section 9.1 and as shown in Fig. 11.14. 
The dashed lines in the figure show the cross-coupling between G , (s) and G,(s), 
which is ignored in the decoupling process. The controller for the subsystem 
with the highest frequencies (and fastest sample rate) is designed first with the 
sample rate selected according to the ideas in this chapter. This digitally controlled 
subsystem is then converted back to a continuous system (see d2c.m in MATLAB), 
which makes it possible to meld it into the slow portion of the plant. This 
transformation is the inverse of Eqs. (4.41) or (4.58) and is exact in the sense that 
the continuous model outputs will match the discrete system at the sample times. 
A discrete representation at the slow sample rate of the entire plant, including the 
previously ignored cross-coupling and the fast controller, can now be obtained 
using conventional techniques. This enables design of a digital controller for the 
slow loop using the resulting discrete model. This model is exact provided the 
fast sample rate is an integer multiple of the slow rate. 

The approximation made during the design of the inner (fast) loop is the 
elimination of the cross-coupling; however, the model of the system at the slow 
sample rate will accurately predict any inner-loop root migration due to the 
approximation and can be corrected by a second design iteration, if needed. 


6 See Kranc (1957). 


Figure 11.14 

Block diagram of a 
MIMO system showing 
methodology for 
multirate analysis using 
successive loop closure 


11.6 Multirate Sampling 471 


Slow controller 


Therefore, we see that the final design of the MR system should perform according 
to that predicted by linear analysis. The effect of the approximations made for 
design of the inner loop serve only to produce an inner-loop controller that may be 
somewhat inferior to that which could be obtained with an accurate MR analysis 
and optimal design of the entire system together as described by Berg, Amit, and 
Powell (1988). 

For systems with three or more identifiable loops where it is desirable to have 
three or more different sample rates, the same ideas are invoked. The innermost 
(fastest) loop is closed first, transformed to a continuous model followed by 
transformation to a discrete model at the next slower sample rate, and so on until 
the final (slowest) loop is closed, encompassing the entire system. 

The sample-rate selection for an MR system designed using SLC is carried 
out using the considerations discussed in Sections 11.1 through 11.5. Note that 
this does not imply that the same sampling multiple, c, /w,, be used for all loops. 
The appropriate sampling multiple is a function of the plant disturbance and 
sensor noise acting on that portion of the system which may be quite different for 
the different loops. Ignoring noise effects leads one to pick the same sampling 
multiple for all loops; that is, the sample rates for each loop are in the same 
proportion as the bandwidth for each loop. 
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€ Example 11.8 


Figure 11.15 

MIMO double 
mass-spring system used 
in Example 11.8 


Advantage of MR Sampling for the Double Mass-Spring System 


Investigate the merits of multirate sampling in terms of rms response to plant noise for the 
double mass-spring system.’ Compare the rms response of an MR controller with that of a 
single rate (SR) controller that uses the same amount of computation effort. 


Solution. The double mass-spring system with control applied to both masses is shown 
in Fig. 11.15. We have defined x, as the absolute position of the large mass, whereas x, is 
the position of the small mass with respect to the large mass. The inner loop consists of the 
measurement of x, and its feedback through a controller to u,. The effect of the coupling from 
x, motion is ignored; so the dynamics are that of a simple mass-spring system. The inner (fast) 
loop was designed using pole-placement for full state feedback so that its open-loop resonant 
frequency of 4 Hz was changed toa closed-loop frequency of 8 Hz, and the damping improved 
from £ = 0 to  =0.7. The sample rate was chosen to be 71 Hz, nine times faster than the 
closed-loop root. 

A continuous model of this fast inner loop was then obtained via d2c.m in MATLAB and 
added to the dynamics of the large mass, including the coupling between the two masses. The 
entire system was then discretized at the slow rate. The outer loop was designed using feedback 
from the slow sampled states so that the rigid-body open-loop roots at s = 0 were changed to 
1 Hz with a damping of ¢ — 0.7. The sample rate for this loop was chosen to be 9 Hz, also 
nine times the closed-loop root. The effect of the dynamics of the large mass and its coupling 
to the small mass along with the outer-loop feedback caused an insignificant movement of the 
closed-loop inner roots. If the inner roots had moved an unacceptable amount, it would have 
been necessary to return to the inner-loop design and revise the design so that it resulted in 
acceptable dynamics after the second loop closure. 

For evaluation of the MR controller, its performance in the presence of plant disturbance 
noise, w, applied to the small mass as shown in Fig. 11.15 will be compared against the 
performance of a single-rate controller. The sample rate of the single-rate controller was 
selected to be 40 Hz, a value that required approximately the same amount of computer 
computation time as the MR controller, thus yielding a fair comparison. Fig. 11.16 shows the 
relative rms responses for the two cases and generally shows the superiority of the MR case in 
that its small mass response was approximately 3096 less than the single-rate case, whereas the 
large mass response was approximately the same. One could say that, for this configuration, 
it pays to assign a large fraction of the computer power on the control of the small mass by 


7 This example is from Berg, Amit, and Powell (1988). 
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Figure 11.16 

Relative rms state and 
control responses for 
Example 11.8 with w 
applied to the small mass 


RMS response with MR controller 
RMS response with SR controller 


Xy Xp A y y 
State or control variable 


employing a faster sample rate for the small mass feedback. This follows because the bandwidth 
of the inner loop was faster and the plant disturbance noise was applied to this mass. 

To analyze the situation further, let us change the plant disturbance noise configuration 
from the small mass to the large mass and keep the controller described above. The same 
comparison of relative rms responses is shown in Fig. 11.17 and indicates a different con- 
clusion. The degradation of the large mass response with the MR controller is more than the 
improvement of the small mass; therefore, one could conclude that the single-rate controller 
was superior in this configuration. The reason is that the noise applied to the large mass (slow 


Figure 11.17 

Relative rms state and 
control responses for 
Example 11.8 with w 
applied to the large mass 


RMS response with SR controller 


RMS response with MR controller 


x, n X3 X2 ui Uy 


State or control variable 
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mode) produced a situation where the overall system response was improved by shifting more 
computer power to the slow mode compared to the previous configuration. The net result was 
that an equal sample rate on the two loops was best in spite of the fact that the roots of the two 
loops differed by a ratio of 8:1. 

The significant benefit of the MR sampling shown in Fig. 11.16 is partially due to the 
fact that both sample rates were relatively slow, that is, nine times their respective closed-loop 
root. Had the sample rates been faster, the advantage would have been less pronounced. The 
reason for this is that the effect of disturbance noise versus sample rate is basically a quadratic 
function, with the errors growing quickly as sampling multiples fall below about 20. If all rates 
had been faster, the system performance would be less sensitive to both sample rates and thus 
less sensitive to the allocation of computer power to the fast or slow loop. 


11.7 


* 


In summary, we see that multirate sampling allows for the assignment of 
available computer power to the control modes that can most benefit, thus im- 
proving system response. The design of multirate systems can usually be carried 
out using a successive loop closure method that employs analytical techniques, 
which are considerably simpler than exact MR techniques. The resulting design 
is typically acceptable. 

Multirate sampling is not necessary in the majority of digital control systems. 
In the usual case, sufficient computer speed is available so that one sample rate 
based primarily on the fastest mode can be selected even though one or more of 
the slower modes will be sampled faster than necessary. This is a better overall 
design because it avoids the complication of multirate sampling, even though it 
could be claimed that computer power is being wasted on some of the slower 
modes. 


Summary 


* For a control system with a specified closed-loop bandwidth, w,, the theo- 
retical lower bound on the sample rate to bandwidth multiple is 
w/w, > 2. 


Other considerations usually dictate that the sample rate be considerably 
faster than 2 times the closed-loop bandwidth. 


e [n order to provide a “reasonably smooth” control response, a sampling 
multiple of 


c/o, > 20 


is often judged desirable. This is a subjective issue and depends heavily on 
the specific application, with slower rates often acceptable. 
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* There is potentially a full sample period delay between command input and 
the system response. In order to limit the possible delay to be 10% of the rise 
time, a sampling multiple of 


w/w, > 20 


is required. 

e The primary purpose of many feedback control systems is to limit the re- 
sponse to random plant disturbances. In comparison to a continuous control 
system with the same equivalent s-plane roots, a digital control system de- 
grades as the sample rate decreases. This degradation is shown by the bottom 
curve in Fig. 11.6 for a double integrator plant. In order to keep the degra- 
dation to be within 20% of a continuous controller, a sampling multiple 
of 


w/w, > 20 


is required. 
e For a system with a resonance at w, that is being damped or stabilized by the 
controller, it is prudent to select a sample rate 


w > 2w. 
S r 


e For a system with a resonance at w, that is being damped or stabilized by the 
controller, it may be useful for robustness purposes to sample considerably 
faster than 2w,. 


* Analog prefilters or antialiasing filters are typically required to attenuate 
the effect of measurement noise at frequencies greater than c, /2. 


e Selection of the prefilter breakpoint, On is acomplex design step that involves 
interactions with the sample rate and control system bandwidth. Generally, 
the further the breakpoint is below w,/2, the better the measurement noise 
attenuation. It is also convenient to maintain the prefilter breakpoint above 
the control system bandwidth; however, this is of lessor importance. The 
tradeoffs for a double integrator plant are depicted in Fig. 11.13. It shows 
that a sampling multiple of 


w/w, > 25 


is required for a breakpoint ratio of w,/w, = 5 if the designer wishes to 
limit the discrete noise degradation to 2096 over that from the continuous 
case. Faster antialiasing filters require faster sample rates. Slower antialias- 
ing filters reduce the effect of measurement noise, but more phase lag is 
introduced into the feedback and, if excessive, may necessitate extra lead in 
the controller. 


e When using a small word size microprocessor (e.g., 8 bits), sampling multi- 
ples greater than 50 were shown in Fig. 11.6 to be counterproductive. 
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11.8 


The performance of a digital control system can sometimes be enhanced 
by using more than one sample rate for different loops in the system. The 
z-transform does not apply directly to this case and a successive loop closure 
design method for such systems was described in Section 11.6. 

The cost of a digital control system increases with sample rate due to the 
microprocessor cost and A/D cost. Therefore, for high volume products 
where the unit hardware costs dominate, picking the slowest sample rate 
that meets the requirements is the logical choice. For a low volume product, 
the labor cost to design the controller far outweighs the extra cost of a fast 
computer; therefore the logical choice of the sampling multiple is on the 
order of 40 or more so that the extra complexity of digital design methods is 
not required. 


Problems 


11.1 


11.2 


11.3 


11.4 


Find discrete compensation for a plant given by 

0.8 
s(s 4- 0.8) 
Use s-plane design techniques followed by the pole-zero mapping methods of Chapter 
6 to obtain a discrete equivalent. Select compensation so that the resulting undamped 
natural frequency (w,) is 4 rad/sec and the damping ratio (2) is 0.5. Do the design 
for two sample periods: 100 ms and 600 ms. Now use an exact z-plane analysis of 
the resulting discrete systems and examine the resulting root locations. What do you 
conclude concerning the applicability of these design techniques versus sample rate? 


G(s) = 


The satellite attitude-control transfer function (Appendix A) is 

T^(z +1) 

2(2 - 1)?” 

Determine the lead network (pole, zero, and gain) that yields dominant poles at = 0.5 


and w, = 2 rad/sec for w, = 1, 2, and 4 Hz. Plot the control and output time histories 
for the first 5 sec and compare for the different sample rates. 


G) = 


Consider the satellite design problem of Appendix A. 


(a) Design full state feedback controllers with both roots at z = 0 for T = 0.01, 0.1, 
and | sec. 


(b) Plot the closed-loop magnitude frequency response of 0 for the three sample rates 
and determine the closed-loop bandwidth for each case. 


(c) Plot the step response for the three sample rates and determine the rise time. 
Compare these values against that predicted by the relation in Section 2.1.7, 
tZ 1.8/o,. 

Following the satellite design problem used for Example 11.3, evaluate the rms errors 

versus sample rate due to plant disturbance noise, using the case where only the attitude, 

6, is measured and used by an estimator to reconstruct the state for control. Base all 

discrete control gains on z-plane roots that are equivalent to = 0.7 and w, = 3 rad/sec 

and estimator gains on z-plane roots equivalent tog = 0.7 and c, = 6 rad/sec. Assume 


11.5 


11.6 


11.7 


11.8 


11.9 


11.10 For your design in Problem 11.9, evaluate the steady state rms noise response of x 
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the torque disturbance has a power spectral density of R wa = 1 N?-m?/sec as was 
done for Example 11.3. Plot the error for c, = 6, 10, 20, 46, 80, and 200 rad/sec and 
comment on the selection of 7 for disturbance rejection in this case. 

In Appendix A, Section A.4, we discuss a plant consisting of two coupled masses. 
Assume M = 20 kg, m= 1 kg, k = 144 N/m, b = 0.5 N-sec/m, and T = 0.15 sec. 


(a) Design an optimal discrete control for this system with quadratic cost J = y? + u?. 
Plot the resulting step response. 


(b) Design an optimal discrete filter based on measurements of d with R, = 0.1 m? 
and a disturbing noise in the form of a force on the main mass M with R = 0.001 
N? E 

(c) Combine the control and estimation into a controller and plot its frequency response 
from d tou. 


(d) Simulate the response of the system designed above to a step input command 
and, leaving the controller coefficients unchanged, vary the value of the damping 
coefficient b in the plant. Qualitatively note the sensitivity of the design to this 
parameter. 

(e) Repeat steps (a) to (d) for T — 0.25 sec. 

For each design in Problem 11.2, increase the gain by 2096 and determine the change in 

the system damping. 

Consider a plant consisting of a diverging exponential, that is, 

x(s) ET 
uls) s—a 


Controlled discretely with a ZOH, this yields a difference equation, namely, 
x(k + 1) = e" x(k) +e? — 1]u(k). 
Assume proportional feedback, 


u(k) = —K x(k), 


and compute the gain K that yields a z-plane root at z = e^". Assume a = | sec™!, 


b = 2 sec™!, and do the problem for T = 0.1, 1.0, 2, 5 sec. Is there an upper limit on 
the sample period that will stabilize this system? Compute the percent error in K that 
will result in an unstable system for T — 2 and 5 sec. Do you judge that the case when 
T — 5 sec is practical? Defend your answer. 


In the disk drive servo design (Problem 9.11 in Chapter 9), the sample rate is often 
selected to be twice the resonance (w, = 6 kHz in this case) so that the notch at the 
Nyquist rate (w,/2) coincides with the resonance in order to alleviate the destabilizing 
effect of the resonance. Under what conditions would this be a bad idea? 


For the MIMO system shown in Fig. 11.15, let k = 10 N/m. Design an MR sampling 
controller where the fast sampling rate on the x, to «u, loop is 10 Hz (T = 100 msec) and 
the slow sampling on the x, to u, loop is 2 Hz (T = 500 msec). Pick the fast poles at 1.5 
Hz with ¢ = 0.4 and the slow poles at 0.3 Hz with = 0.7. 


and 
1 
x, for an rms value of w = 0.1 N. Repeat the design with the same pole locations, for 
two cases: 


(a) fast sample period T,,,, = 50 msec, and slow sample period 7,,,,, = 500 msec, 


fast 


low 
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(b) fast sample period Trasi = 100 msec, and slow sample period 7 pọ = 250 msec, 
and evaluate the rms response in both cases. If extra computational capability 
existed, where should it be used? In other words, which sample rate should be 
increased? 


11.11 Replot the information in the lower curve in Fig. 11.6 (that is, no quantization noise) 
for three cases: (a) process noisc and no measurement noise (the case already plotted), 
(b) process noise and measurement noise with an rms = 1, and (c) measurement noise 
with an rms = | and no process noise. Use exactly the same controller and estimator 
as the current plot. Note that FIG116.M assumes R, = 1 for purposes of computing the 
continuous estimator gains; however, in computing the rms response, no measurement 
noise was included. 


 12- 


System Identification 


A Perspective on System Identification 


In order to design controls for a dynamic system it is necessary to have a model 
that will adequately describe the system's motion. The information available 
to the designer for this purpose is typically of two kinds. First, there is the 
knowledge of physics, chemistry, biology, and the other sciences which have 
over the years developed equations of motion to explain the dynamic response of 
rigid and flexible bodies, electric circuits and motors, fluids, chemical reactions, 
and many other constituents of systems to be controlled. However, it is often 
the case that for extremely complex physical phenomena the laws of science 
are not adequate to give a satisfactory description of the dynamic plant that we 
wish to control. Examples include the force on a moving airplane caused by 
a control surface mounted on a wing and the heat of combustion of a fossil 
fuel of uncertain composition. In these circumstances, the designer turns to data 
taken from experiments directly conducted to excite the plant and measure its 
response. The process of constructing models from experimental data is called 
system identification. In identifying models for control, our motivation is very 
different from that of modeling as practiced in the sciences. In science, one seeks 
to develop models of nature as it is; in control one seeks to develop models of 
the plant dynamics that will be adequate for the design of a controller that will 
cause the actual dynamics to be stable and to give good perfocmance. 

The initial design of a control system typically considers a small signal anal- 
ysis and is based on models that are linear and time-invariant. Having accepted 
that the model is to be linear, we still must choose between several alternative 
descriptions of linear systems. If we examine the design methods described in 
the earlier chapters, we find that the required plant models may be grouped in 
two categories: parametric and nonparametric. For design via root locus or pole 
assignment, we require a parametric description such as a transfer function or a 
state-variable description from which we can obtain the poles and zeros of the 
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parametric model 


nonparametric model 


plant. These equivalent models are completely described by the numbers that 
specify the coefficients of the polynomials, the elements of the state-description 
matrices, or the numbers that specify the poles and zeros. In either case we call 
these numbers the parameters of the model, and the category of such models is 
a parametric description of the plant model. 

In contrast to parametric models, the frequency-response methods of Nyquist, 
Bode, and Nichols require the curves of amplitude and phase of the transfer 
function G(e/^") = ¥(jw)/U(jw) as functions of c. Clearly, if we happen 
to have a parametric description of the system, we can compute the transfer 
function and the corresponding frequency response. However if we are given 
the frequency response or its inverse transform, the impulse response, without 
parameters (perhaps obtained from experimental data) we have all we need to 
design a lead, lag, notch, or other compensation to achieve a desired bandwidth, 
phase margin, or other frequency response performance objective without ever 
knowing what the parameters are. We call the functional curves of G(e/®’) a 
nonparametric model because in principle there is no finite set of numbers that 
describes it exactly. 

In addition to the selection of the type of model desired, one must define 
the error to be minimized by the model estimate. For transfer functions, if we 
assume the true value is G(e/^") and the model is given by G(e/”7), then one 
possible measure is f (G(e/*") — G(e/*"))*do. Clearly other measures are also 
possible and can be considered. For discrete-time parametric models a standard 
form assumes that the output is generated by the discrete equation 


a(z)y = b(z)u + c(z)v, (12.1) 


where a, b, and c are polynomials in the shift operator! z, y is the output, u is the 
input, and v is the error and assumed to be a random process independent of u. If 
c(z) = a(z), then v is called the output error, for in that case Eq. (12.1) becomes 
y = [b(z)/a(z)] u + v, and v is noise added to the output of the system. If e(z) 
= | then v is called the equation error, for it is the error by which the data y and 
u fail to satisfy the equation given by a(z)y = b(z)u. Finally, if c(z) is a general 
polynomial, then Eq. (12.1) is equivalent to a Kalman filter and v is called the 
prediction error. 

The data used in system identification consist mainly of the records of input, 
u(k), and output, y(k) and assumptions about the error. However, in addition 
to these records there is usually very important information about the physical 
plant that must be used to get the best results. For example, the plant transfer 
function may have a known number of poles at z = | corresponding to rigid 
body motion. If there are such known components, the data can be filtered so 
that the identification needs only estimate the unknown part. Also typically it 


1 Some authors use the variable q as the shift operator to avoid confusion with z as the variable of the Z 
transform. It is expected that the rare usc of z as an operator will not cause confusion here. 
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is known that the important features of the transfer function are contained in 
a known frequency range and that data outside that range is only noise. It is 
important to filter the raw data to remove these components in so far as possible 
without distorting important features to prevent the algorithm from distorting the 
estimated model in a futile effort to explain this noise. The important conclusion 
is that all a priori information be used fully to get the best results from the 
computations. 

In summary, identification is the process by which a model is constructed 
from prior knowledge plus experimental data and includes four basic compo- 
nents: the data, the model set, the criterion by which one proposed model is to 
be compared with another and the process of validation by which the model is 
confirmed to mimic the plant on data not used to construct the model. Because 


“of the large data records necessary to obtain effective models and the complex- 


ity of many of the algorithms used, the use of computer aids is essential in 
identification. Developments such as the MATLAB System Identification Tool- 
box are enormous aids to the practical use of the techniques described in this 
chapter. 


Chapter Overview 


In this chapter, we consider several of the most common and effective approaches 
to identification of linear models. In Section 1 we consider the development of 
linear models as used in identification. In Section 2 the construction of non- 
parametric models is considered. Data taken by single sinusoidal signal inputs 
as well as data in response to a complex signal are considered. Also presented 
is the concept of including prior knowledge of the plant response, including 
the possibility of known pole locations for rigid body cases. In Section 3 tech- 
niques for identification of parametric linear models based on equation error 
are introduced with the selection of parameters and definition of the error. Al- 
gorithms for batch least squares are given in Section 4, recursive least squares 
in Section 5, and stochastic least squares in Section 6. It is shown that bias is 
encountered when the equation error is not a white noise process in each of the 
techniques. In Section 7 the method of maximum likelihood is introduced as a 
method that can overcome the bias considering the prediction error. Finally, in 
Section 8 the estimation of state description matrices based on subspace methods 
is described. 


Defining the Model Set for Linear Systems 


Formally, one proceeds as follows with the process of linearization and small- 
signal approximations. We begin with the assumption that our plant dynamics are 
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adequately described by a set of ordinary differential equations in state-variable 
form as discussed in Section 4.3.6 


dca Rh, cet eee. E) 
Rye feb in opp necs a 
t = fom tatem xh 
P= h Qs ctt og us ctu. EJ 
y, =A 5, X, 44, Ups t), (12.2) 
or, more compactly in matrix notation, we assume that our plant dynamics are 
described by 
x — f(x, u, 1), X(t) =X, 
y = h(x, u, r). (12.3) 


The assumption of stationarity is here reflected by the approximation that f 
and h do not change significantly from their initial values at 1. Thus we can set 
X = f(x, u, 1) 

or, simply, 
X — f(x, u), 
y = h(x, u). (12.4) 
The assumption of small signals can be reflected by taking x and u to be 


always close to their reference values x,, Up, and these values, furthermore, to be 
a solution of Eq. (12.4), as 


X, = f(X,, Uy) (12.5) 


Now, if x and u are close to x, and Up they can be written as x = X, + 
óx; u= u, + du, and these can be substituted into Eq. (12.4). The fact that ôx 
and ôu are small is now used to motivate an expansion of Eq. (12.4) about x, and 
u, and to suggest that only the terms in the first power of the small quantities 
óx and óu need to be retained. We thus have a vector equation and need the 
expansion of a vector-valued function of a vector variable 


d 
a 09 + 99 = f(x, + ôx, u, + ôu). (12.6) 


If we go back to Eq. (12.2) and do the expansion of the components f. one at a 
time, it is tedious but simple to verify that Eq. (12.6 ) can be written as 


X, +6x = f(x), uy) + f, (x,, uj)óx + f, (xy, Uy)du + --- (12.7) 
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where we define the partial derivative of a scalar f, with respect to the vector x 


by a subscript notation 
^ 9f, 9f, 
=(— © IJ. 12.8 
fis e Ox, Ve) 
The row vector in Eq. (12.8) is called the gradient of the scalar f, with respect 
to the vector x. If f is a vector, we define its partial derivatives with respect to the 
vector x as the matrix (called the Jacobean) composed of rows of gradients. In 
the subscript notation, if we mean to take the partial of all components, we use 
the bold vector as the subscript 
f, 
Ox 
0f, 


9f, 

ox 
Now, to return to Eq. (12.7), we note that by Eq. (12.5) we chose x,, u, to be a 
solution so the first terms of Eq. ( 12.7) balance, and, because the terms beyond 
those shown depend on higher powers of the small signals ôx and du, we are led 
to the approximation 


dx 2c, f, (X,, u)óx + f, (Xp, U,)dU 
ôy = h ôx +h, ôu. (12.10) 


Now the notation is overly clumsy and we drop the ô-parts and define the constant 
matrices 


F f(x,uj), G = f, (Xo uy) 
H = h(x,u) J= h, (Xp uy) 


to obtain the form we have used in earlier chapters 


x = Fx + Gu, y = Hx + Ju. (12.11) 


We will go even further in this chapter and restrict ourselves to the case of single 
input and single output and discrete time. We write the model as 


x(k + 1) = Px(k) + Pu(k), 


y(k) = Hx(k) + Ju(k), (12.12) 
from which the transfer function is 
G(z) = Y(z)/U(z) = H(zY - )^ T + J, (12.13) 


and the unit pulse response is 


g02J7  g(k)=H 7T k=1,... (12.14) 
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12:2 


identification, 
one-frequency-at-a-time 


The System Identification Toolbox includes the multivariable case, and multivari- 
able theory is covered in a number of advanced texts devoted to identification, 
including Ljung (1987). 


Identification of Nonparametric Models 


One of the important reasons that design by frequency-response methods is so 
widely used is that it is so often feasible to obtain the frequency response from 
experimental data. Furthermore, reliable experimental estimates of the frequency 
response can be obtained when noise is present and when the system is weakly 
nonlinear. Models based on such frequency-response data can then be used ef- 
fectively in control system design using, for example, the methods of Bode. The 
presence of nonlinearity leads to the concept of the “describing function,” a topic 
that will be considered in Chapter 13. Here we will introduce and illustrate ex- 
perimental measurement of the frequency response in the case of linear, constant, 
stable models, including the possibility of unmeasured noise inputs to the system. 
The situation to be considered is described by the transform equations 


Y(z) = G(z)U (2) + H(z) W(z). (12.15) 


In this equation, Y is the plant output, U is the known plant control input, and 
W is the unmeasured noise (which might or might not be random). G is thus the 
plant transfer function and H is the unknown but stable transfer function from 
the noise to the system output. The frequency response of this system, as was 
discussed in Chapter 2, is the evaluation of G(z) for z on the unit circle. If the 
noise is zero and the input u(k) is a sinusoid of amplitude A and frequency w, 
then 4(K) is given by 


u(kT) = A sin(@, kT), (12.16) 

and the output samples, in the steady state, can be described by 
y(kT) = B sin(w kT T3: (12.17) 

where 
B/A = |G(e^7)| 

$, = Z G(e*v?). (12.18) 
Given the situation described by Eq. (12.15), there are two basic schemes 
for obtaining a nonparametric estimate of G(e/""). The first of these can be 
described as the one-frequency-at-a-time method and the other as the spectral 
estimate method. We will describe these in turn. In each case we will require the 
computation of Fourier transforms using finite records of data; and for this the 


discrete Fourier transform (DFT) described in Chapter 4 is used, especially in its 
computationally optimized form, the fast Fourier transform (FFT). 
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The Method of One Frequency at a Time 


We assume that we have a system as described in Eq. (12.15) and that the input is 
a sinusoid of the form of Eq. (12.16). The input and the output are recorded for N 
samples, and we wish to find the amplitude B and the phase $,, from which one 
point on the frequency response can be computed according to Eq. (12.18). We 
then increment the frequency to w, = w, + dw and repeat until the entire range of 
frequencies of interest is covered with the desired resolution. The accuracy of the 
estimate at any one point is a function of the number of points taken, the relative 
intensity of the noise, and the extent to which system transients have been allowed 
to decay before the recordings are started. We will describe the calculations in an 
ideal environment and discuss briefly the consequences of other situations. 

Consider an output of the form of Eq. (12.17) but with unknown amplitude 
and phase. Recognizing that the experimental data might reflect some nonlinear 
effects and contain some noise component, a reasonable computational approach 
is to find that amplitude B and phase $, which best fit the given data. We define 
the estimate of the output as 


$(kT) = Bsin(w,kT + d) 
= B.cos(@ kT) + B, sin(w)kT), 


where B, = B sin($,) and B, = B cos($,). Once we have computed B, and B, 
we can compute the values of B and @, as 


B= y (B? + Bb), 


B 
$, = arctan z (12.19) 


s 


To find the best fit of y to the data, we form the quadratic cost in the error between 
y and y as 


1 N-1 : 
m XORT) — (kT)? 
k=0 
] N-1 
TM 2 y0 T) — B_cos(w kT) — B,sin(w,kT))’. 


Keep in mind that y(kT) is the measured output and y(kT) is the computed 
estimate of the output. We wish to compute B, and B, so as to make J as small 
as possible. To do this, we set the derivative of J with respect to B, and B, equal 
to zero. Consider B, first 


N-1 


Y 20r) — B, cos(o,kT) — B, sin(w,kT))(—cos(w,kT)) = 0. 
k=0 


8r 317 
dB. NE 
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If we use elementary trigonometric identities for the product of sine and cosine, 
and collect terms of the sum, this expression is equivalent to 


N-1 


l Bi. D ee 
— y(kT) cos(w,kT) — — — — cos(2w,kT) 


N-1 
27 y =0. (12.20) 
Generally, we would expect the last two terms in Eq. (12.20) to be small because 
the sinusoids alternate in sign and have zero average value. They will be exactly 
zero if we select the test frequencies carefully. Consider the following identity 
(which also played an important role in the development of the DFT in Chapter 4) 


pen fY 
-k 
— ) ZZ = —————. 
N rmm Nd -z") 


If we select integers £ and N and let z = e/**"/", corresponding to w = 272/N, 


then 
1 Y eintuim e i> e 
N er l Ar e ("UN 
l, £2rN 
| O elsewhere (12-21) 


If we substitute Euler's formula, e7/?"/" = cos(2m&k/N) — j sin(2zt £k/ N), 
into the left side of Eq. (12.21) and equate real and imaginary parts, we find that 


l N-I 


N 


l 4-—sN 


cos(2z £k/ N) = | 0 elsewhere 


k=0 
and also 


l N-1 
— ) sin2z£k/N) = 0. 
N k=0 


If we now select our test frequencies to be w, = 27 £/NT, for integer €, then 
Eq. (12.20) reduces to 


2 N-1 
B = " E y(kT) cos(2z £k/ N). (12.22) 
In similar fashion, for this choice of input frequency, one can show that 
2 N-1 
B= E v» y(kT) sin(2z £k/N). (12.23) 
k=0 


Thus Eq. (12.22) and Eq. (12.23) give us formulas for computing the values of 
B, and B,, and Eq. (12.19) gives us the formulas for the gain and phase that 
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describe the output data y(k7). The transfer function from which the output was 
measured is then given by Eq. ( 12.18). 

Equation (12.22) is closely related to the DFT of y derived in Section 4.5. 
In fact, if we assume that y(kT) is a sinusoid of frequency w, = 2: €/NT and 
consider the DFT of y 


N-1 
DFT[y(kT)] = Spanien" 
yai 
= Y y(kT)[cosQzkn/N) — j sin(2zkn/N)] 
k=0 
3[B,-jBl n=e 
0 — 


Likewise, the DFT of the sinusoidal input of the same frequency is given by 


N-I 
DFT[u(kT)] = J  AsinQz£k/ Nye FIN 
k=0 
N-1 
= 2; TEL = g PRIN o Poeni] 
k=0 
NA 
= Saget (n = £) 
2j 
= 0, elsewhere. (12.24) 


Thus, for n = £, the ratio of the DFT of y to that of u is 


DFT() _(N/2)(B, - jB) 
DFT(u) NAJ2j 
a B, "E IB, 
EN 
= |Gle’%, (12.25) 


which is the result found in Section 4.5. Of course, using the DFT or the FFT 
on y(kT) and u(kT) just for one point on the transfer function is unnecessarily 
complicated, but because the DFT automatically computes the best fit of a sinu- 
soid to the output in the least-squares sense, the result is very accurate and, with 
the use of the FFT, it is fast. 

It is possible to take advantage of Eq. (12.25) to evaluate the transfer function 
at several frequencies at once by using an input that is nonzero over a band of 
frequencies and computing the ratio of output to input at each nonzero value. 
To illustrate this technique, we will use an input known as a chirp, which is a 
modulated sinusoidal function with a frequency that grows from an initial value 
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to a final value so as to cover the desired range. An example of a chirp is given 
by the formulas 


u(kT) = A, + w(k)sin(w,kT),  O<k<N-1, 
w(k) = A sat(k/0.1 N)sat((N — k)/0.1N), 


k start 


k 
Wy = Gu + 57 (Pana 7 a (12.26) 


In this expression, w(k) is a weighting or window function that causes the input 
to start at zero, ramp up to amplitude A, and ramp down to zero at the end. The 
"sat" is the saturation function, which is linear from —1 to +1 and saturates at 
—] for arguments less than —1 and at +1 for arguments greater than +1. The 
constant A, is added to make the input have zero average value. 


€ Example 12.1 


A Chirp Signal 


Plot a chirp of maximum amplitude 1.0, starting frequency wT = 0.1 rad and ending frequency 
wT = 0.75 rad. Use 256 points. Also plot the magnitude of the transform of the signal. 


Solution. From Eq. (12.26) we have A = 1, TO ar, = 9-1, To, = 0.75, and N = 256. 
Theconstant A, was computed as the negative of the average of the basic signal, using MATLAB. 
The transform was computed using fft.m. The results are plotted in Fig. 12.1, parts (a) and (b). 
Notice that the actual spectrum is very flat between 0.3 and 1.0 rad and has non-negligible 
energy over the wider range between 0.1 and 1.5 rad. The energy above Tw, = 0.75 rad is 
the result of the fact that the chirp is really a frequency modulated signal and many sidebands 


are present. 


* 


€ Example 12.2 


Computing a Transfer Function 


Apply the chirp signal of Example 12.1 to a system obtained as the zero order hold equivalent 
of 1/(s? +0.25s +1) with T = 0.5. Compute the estimate of the transfer function using 
Eq. (12.25). 


Solution. The results of the application of Eq. (12.25) to this data are plotted in Fig. 12.1, 
parts (c) and (d). The true frequency response is plotted as a solid line and the experimental 
transfer function estimate is plotted as x's on the same graphs. As can be seen, the estimate 
is a very accurate measure for frequencies up to 1.8 rad, after which the estimate deviates 
badly from the true magnitude and phase curves. This deviation coincides with the frequencies 
where the input has little energy and leads us to conclude that one should not trust the result of 
Eq. (12.25) unless the input has significant energy at the computed frequency. 


* 


Figure 12.1 

Estimation of transfer 
function using a chirp 
signal: (a) the chirp, 

(b) the chirp's spectrum, 
(c) the transfer function 
and its magnitude 
estimate, (d) the true 
and estimated phase 
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Data Conditioning 


As illustrated by Example 12.1, a signal can be constructed to have energy only in 
a certain band, and with its use the transfer function can be accurately estimated 
in this band. This observation is a particular case of an extremely important 
principle in system identification: 


Use all available prior knowledge 
to reduce the uncertainty in the estimate. 


In the case of Example 12.1, we were able to use the fact that the important 
features of the unknown transfer function are in the digital frequency range 
0.1 x wT x 1.5 to design a chirp signal that would provide energy in that range 
only. Another situation, which was developed by Clary (1984), occurs when 
part of the transfer function is already known. This is common in the control of 
mechanical devices, such as robot mechanisms or space structures, where one 
knows that the transfer function has two poles at the origin in the s-plane and 
thus two poles at z — 1 in the z-plane. We may also know how fast the transfer 
function goes to zero for high frequencies and from this knowledge assign zeros 
of the transfer function at z = — 1 to correspond to these. This assumption is not 
exact, because zeros at infinity in the s-plane do not map to any fixed point in the 
z-plane; however, in almost all cases, they result in z-plane zeros that are often 
well outside the unit circle so that they have almost no influence on the frequency 
response over the important band and are very difficult to identify accurately. It 
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Clary's method 


is for this reason that known s-plane zeros at infinity can be assigned to z — —1 
and used to reduce the uncertainty in the discrete transfer-function estimation. 
To illustrate Clary's procedure, suppose we model the known poles as the 
roots of the polynomial a, (z) and model the known zeros as the roots of 5, (z). 
Then the overall transfer function can be factored into known and unknown parts 
in the form 
b. (z) 
a,(z) 


In block-diagram form, the situation can be arranged as shown in Fig. 12.2. In 
this case, once the simpler unknown G, (z) is estimated from the filtered data u 
and y " the overall transfer function G(z) can be reconstructed from Eq. (12.27). 


G(z) — G,(z). (12.27) 


€ Example 12.3 


Figure 12.2 

Block diagram of 
identification of a 
partially known transfer 
function 


Using Known Components 


Consider the system formed by multiplying the plant of Example 12.2 by 1/s?. Identify the 
corresponding discrete system by Clary's method. 


Solution. The resulting fourth-order discrete system has two poles at z = | and zeros at 
z = —9.5, z = —0.98, and z = —0.1. If thesamechirp signal used in Example 12.2 is used to 
identify this transfer function from its input and output, the results are shown in Fig. 12.3(a) 
and (b). Clearly the estimates, shown by x's on the plots, are very poor in both magnitude 
and phase. However, if u is filtered by (1 +27')? and y is filtered by (1— z y, then the 
transfer function G, can be estimated from these signals. Afterward, the estimated frequency 


12.2 


Figure 12.3 
Transfer function 
estimates for a 
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responses are multiplied by the known frequency responses according to Eq. (12.27) and the 
total frequency response is plotted in Fig. 12.3(c) and (d). Now an excellent estimate of the 
transfer function results. 

+ 


Our final example of the use of prior information in identification occurs 
when it is known what the important frequency range is and, furthermore, that 
significant sensor noise appears in the plant output outside this range. Such noise 
is almost always present in experimental data and typically includes both noise 
from physical sources and noise that can be attributed to the quantization of 
A/D conversion. In this case, we observe that the identification of G(z) is the 
same as the identification of F(z)G(z) F(z)~' because the filter transfer function 
F cancels. However, suppose that we filter both u and y through F. Although 
this has no influence on the relation between u and y, it can suppress the noise 
significantly. The equations that illustrate this effect are 


y = Gu +v, 
Fy = GFu + Fv, 
y= Gu; + Fv. (12.28) 


Therefore, we have succeeded in replacing the noise input v with the filtered 
noise Fv while keeping the same relation between the signal input and the 
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output as before. If the noise includes significant energy at frequencies that 
are not important to the signal response, then F can be shaped to remove these 
components and greatly improve the transfer-function estimates. Most commonly, 
F is a lowpass filter such as a Butterworth characteristic with a cutoff frequency 
just above the range of important frequencies for the plant model. In the case 
of nonparametric estimation, the effect of the lowpass filter can be achieved 
simply by ignoring the estimates outside the desired range, as could be seen from 
Fig. 12.1. However, when we come to parametric estimation, the use of such 
filters will be found to be essential. 


Stochastic Spectral Estimation 


The deterministic methods just described require a high signal-to-noise ratio in 
the important band of the transfer function to be estimated. If this is not available, 
as frequently occurs, an alternative based on averaging or statistical estimation is 
available. The necessary equations are easily derived from the results on stochastic 
processes given in Appendix D. Suppose we describe the plant dynamics with 
the convolution 

oo 


y(n) = D> g(kyu(n — k) (12.29) 


k=—00 


and assume that u and therefore y are zero mean stochastic processes. If we 
multiply Eq. (12.29) on both sides by u(n — £) and take expected values, we 
obtain 


£y(n)u(n — € 2€ D> güQu(n — k)yu(n — 0), 


kz—oo 
00 


RO = D> gk)R,,(€—), (12.30) 


yu uu 
(——o00 


where R „(£) is the cross-correlation between the y process and the u process 
and R, (£ — k) is the autocorrelation of the u process. If we take the z-transform 


uu 


of Eq. (12.30), it is clear, because this is a convolution, that 


S,, (2) = G()S,, (z) 


yu ua 
and thus that 


S) 
= 12.31 
G(z) s o ( ) 


where a (z), called the cross-power spectrum between the y and the u processes, 
is the z-transform of Ry, (£) and S, (z), the z-transform of R,, (£), is called the 


uu 


power spectrum of the u process. 
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With these equations, the estimation of the transfer function G(z) has been 
converted to the estimation of the correlation functions R s and R pu or, equiva- 
lently, to the estimation of the spectra S,, and S,, from the data records of y(k) 
and u(k). This topic, spectral estimation, is covered by a vast literature going back 
to the 1930’s at least, and we will indicate only some of the important results 
relevant to the identification problem here. In the first place, our estimation of 
the correlation functions depends upon the ergodic assumption that the stochastic 
expectations can be computed as time averages according to ° 


N 


Y y(n)u(n - 0. (12.32) 


nz-N 


RO = 1H ON 31 


In order to use Eq. (12.32) with experimental data, we must consider that the data 
are finite. Suppose we have N points of input and N points of output data. Then 
we can write 

3 lx 

Ry 73 y(n)u(n — £), (12.33) 
where it is understood that y(k) and u(k) are zero outside the range 0 < k < 
N — 1. The number of data values used to compose R,, (£) in Eq. (12.33) is N — £. 
Clearly, as £ gets near to N, this number gets small and very little averaging is 
involved. We conclude that the estimate of R from Eq. (12.33) can be expected to 
be very poor for |£| near N, and we normally stop the sum for |£| > L/2, where 
we might take L = N/8, for example. The resulting estimate is a complicated 
function of the exact number of points used, and typically, in identification, the 
number to be used is determined by experience and experimentation. Another 
feature of Eq. (12.33) is the fact that in taking a finite number of points in the 
sum, it is as if we had multiplied the result, for any given £, by a window of 
finite duration. The transform reflection of such a product is to convolve the 
true spectrum (transform) with the transform of the window, and the rectangular 
window obtained by simply truncating the series is particularly poor for this 
purpose. Rather, another window is normally introduced that has better spectral 
smoothing properties, such as the Hamming or the Kaiser window. To illustrate 
some of the properties of windows, a Hamming window of length 64 and its 
magnitude spectrum are plotted in Fig. 12.4. It should be recalled that convolution 
of a function with an impulse returns exactly the same function. Convolution with 
a bandpass filter of finite width has the effect of averaging the function over the 
width of the pass band. Fig. 12.4 shows that the Hamming window of length 64 
has a spectrum with a main lobe of width about 8/128, or 6.2546 of the Nyquist 
frequency. A longer window would have a more narrow pass band, and a shorter 
window would have a wider spectral lobe and consequently more averaging of 
the spectrum in computing the estimate by Eq. (12.33). The engineer must use 


2 The MATLAB function xcorr computes R,, (—€) = R,,(£. 
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Figure 12.4 

Hamming window: 

(a) weighting function, 
and (b) spectrum 
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judgement to select a window function length that will balance noise reduction, 
for which a short window is wanted, against average spectrum accuracy, for which 
a long window is wanted. 

From these considerations, a practical formula for spectrum estimation is 
derived as 


L/2 


$ (= on w()R. G)z"*. (12.34) 


yu yu 
t=-L/2 


€ Example 12.4 


Stochastic Estimate of a Transfer Function 


Apply a random input signal to the second-order system of Example 12.2 and estimate the 
transfer function using Eq. (12.31) with the estimates according to Eq. (12.34). Repeat, with 
20% random noise added to the output. 


Solution. The system was excited with a 256-point random input of zero mean, and the 
autocorrelation and the cross-correlation were computed for all 512 points for the output with 
no noise and for the output with 20% noise. That is, the input to the system had a root-mean- 
square value of one, and the output had a noise added whose root-mean-square value was 
0.2. A Hamming window of width 256 was used and the spectra computed for the system 


Figure 12.5 

Estimation of transfer 
function with random 
input using spectral 
estimation: (a) and (b) no 
noise, (c) and (d) 20% 
noise 
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with no noise and for the system with output noise. The resulting transfer functions computed 
according to Eq. (12.31) are plotted in Fig. 12.5. 


12.3 


+ 


As can be seen from parts (a) and (b) of Fig. 12.5, the estimate when there is 
no noise is excellent. However, with the noise, as seen in parts (c) and (d), there 
is serious deviation of the estimate from the true transfer function, especially at 
the higher frequencies. In a real situation, serious consideration would be given 
to using a shorter window, perhaps as short as 100 or even less. As the window is 
shortened, however, the details of the resonant peak, as discussed, will be smeared 
out. The engineer’s judgement and knowledge of the system under investigation 
must be used to balance the random fluctuations against the systematic errors. 


Models and Criteria for Parametric 
Identification 


Because severa] design techniques including design by root locus and design 
by pole placement require a parametric model, it is important to understand how 
such models can be constructed. This is particularly important for adaptive control 
where the design must be implemented on-line and either the plant parameters 
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12.3.1 


or the controller parameters are to be estimated. To formulate a problem for 
parametric identification, it is necessary to select the parameters that are to be 
estimated and a criterion by which they are to be evaluated. 


Parameter Selection 


For example, suppose we have a transfer function that is the ratio of two polyno- 
mials in z. We can select the coefficients of these polynomials as the parameters. 
For the third-order case 


bz +b,z +b, 
z +a, +a, +a, 
and the parameter vector 0 is taken as 
O= (a, a, a b, b, blo. (12.36) 


We imagine that we observe a set of input sample values [u(k)] and a set of 
corresponding output sample values [y(k)], and that these come from a plant that 
can be described by the transfer function Eq. (12.35) for some “true” value of 
the parameters, 0°. Our task, the parametric identification problem, is to compute 
from these [u(k)] and [y(k)] an estimate 9 that is a “good” approximation to 9^? 

To repeat the formulations from a different point of view, suppose we postu- 
late a (discrete) state- variable description of the plant and take the parameters to 
be the elements of the matrices $, F, H. We assume J = 0. Then we have 


x(k + 1) = $(0,)x(k) + F'(0,)u (k), 
y(k) = H(8))x(&). (12.37) 


For the third-order model, there are nine elements in P and three elements each 
in I and H, for a total of fifteen parameters in Eq. (12.37), where six were enough 
in Eq. (12.35 ) for an equivalent description. If the six-element 0 in Eq. ( 12.36) 
and the fifteen-element 0, in Eq. (12.37 ) describe the same transfer function, 
then we say they are equivalent parameters. Any set of u(k) and y(k) generated 
by one can be generated by the other, so, as far as control of y based on input 
u is concerned, there is no difference between 6 and 0, even though they have 
very different elements. This means, of course, that the state-variable description 
has nine parameters that are in some sense redundant and can be chosen rather 
arbitrarily. In fact, we have already seen in Chapter 4 that the definition of the 
state is not unique and that if we were to change the state in Eq. (12.37) by the 
substitution ë = Tx, we would have the same transfer function G (2). It is exactly 
the nine elements of T which represent the excess parameters in Eq. (12.37); 
we should select these in a way that makes our task as easy as possible. The 
standard, even obvious, way to do this is to define our state so that «b, T, H arein 


G(z,0) — (12.35) 


3 For the moment, we assume that we know the number of states required to describe the plant. Estimation 
of n will be considered later. 


Figure 12.6 


12.3 Models and Criteria for Parametric Identification 497 


accordance with one of the canonical forms of Chapter 2 for transfer functions. 
For example, in observer canonical form we would have 


=a, 1 0 b, 
p= | -a, 0 ! r=] b, |, HJ. 0 €], (12.38) 
~a, 0 0 b, 


and we see immediately that these matrices are just functions of the 0 of 
Eq. (12.36), and the equivalence of 0, to 0 is obvious. 

But now let us repeat, in this context, the comments made earlier about the 
difference between identification for control and modelling for science. In taking 
the canonical form represented by Eq. (12.38), we have almost surely scrambled 
into the a; and b, a jumble of the physical parameters of masses, lengths, spring 
constants, coefficients of friction, and so on. Although the physical nature of the 
problem can be best described and understood in terms of these numbers, it is the 
a, and b, that best serve our purpose of control, and it is with these that we will 
be concerned here. 

Before leaving the matter of canonical forms, it is appropriate at this point to 
introduce another form called, in digital signal processing, the ARMA* model; 
this model has one feature especially appropriate for identification. A block 
diagram of the ARMA realization of G(z) is shown in Fig. 12.6. Elementary 
calculations left to the reader will show that the transfer function of this block 


Block diagram of the ARMA realization of G(z) 


4 ARMA, an acronym for AutoRegressive Moving Average, comes from study of random processes 
generated by white noise filtered through the transfer function G(z) of Eq. (12.35). We will have more 
to say about this after the element of randomness is introduced. 
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parameters of the ARMA 
model 


12.3.2 


diagram is given by Eq. (12.35). The state equations are more interesting. Here 
we find six states and the matrices 


=a ud. =a, bi hy bs 0 

p O QU o 0 9 0 

0 L © OIL 0 
216 6 9 06 oI P=) 

0 0 2 1 0 D 0 

D w x 33.3] 0 
H-[-a, -a, -a, b, b, b, ]. (12.39) 


From the point of view of state-space analysis, the system described by 
Eq. (12.39) is seen to have six states to describe a third-order transfer func- 
tion and thus to be “nonminimal.” In fact, the ® matrix can be shown to have 
three poles at z = 0 that are not observable for any values of a, or b;. However, 
the system does have one remarkable property: The state is given by 


x(k) 2[y(k 1) y(k—2) y(k—-3) u(k—1) u(k—2) u(k—3). 
(12.40) 
In other words, the state is exactly given by the past inputs and outputs so that, if 
we have the set of (u(k)] and [y(k)], we have the state also, since it is merely a 
listing of six members of the set. 
AII the action, as it were, takes place in the output equation, which is 


y(k) = Hx(k) 
= —a,y(k — 1) — a,y(k — 2) — ay (k — 3) 
+b,u(k — 1) + b,u(k — 2) + bu(k — 3). (12.41) 


There is no need to carry any other equation along because the state equations are 
trivially related to this output equation. We will use the ARMA model in some 
of our later formulations for identification. 

Thus we conclude that within the class of discrete parametric models we 
wish to select a model that has the fewest number of parameters and yet will 
be equivalent to the assumed plant description. A model whose parameters are 
uniquely determined by the observed data is highly desirable; a model that will 
make subsequent control design simple is also often selected. 


Error Definition 


Having selected the class of models described by our assumed plant description, 
we now turn to the techniques for selecting the particular estimate, 0, that best 
represents the given data. For this we require some idea of goodness of fit of a 
proposed value of 0 to the true 8°. Because, by the very nature of the problem, 8° 
is unknown, it is unrealistic to define a direct parameter error between 0 and 0". 
We must define the error in a way that can be computed from [u(k)] and [y(k)]. 


equation error 
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Three definitions that have been proposed and studied extensively are equation 
error, output error, and prediction error. 


Equation Error 


For the equation error, we need complete equations of motion as given, for 
example, by a state-variable description. To be just a bit general for the moment, 
suppose we have a nonlinear continuous time description with parameter vector 
0, which we can write as 


x — f(x, u; 0). 


We assume, first, that we know the form of the vector functions f but not the 
particular parameters 0° that describe the plant. We assume, second, that we are 
able to measure not only the controls u but also the state x and the state derivative 
x. We thus know everything about the equations but the particular parameter 
values. We can, therefore, form an error comprised of the extent to which these 
equations of motion fail to be true for a specific value of 0 when used with the 
specific actual data x, , x,, and u,. We write 

> — if 


a? 


u,; 0) = e(t; 0), 


and we assume that there is a vector of true parameters 8° such that e(t; 6°) = 0. 
The vector e(t; 0) is defined as the equation error. The idea is that we would 
form some non-negative function of the error such as 


T 
J (0) =| e' (t, 0) e(t, 0) dt, (12.42) 
0 


and search over 0 until we find Ô such that J (0) is a minimum, at which time 
we will have a parameter set 0 that is an estimate of 0° . If we had an exact set 
of equations and selected a unique parameterization, then only one parameter 
set will make e(t; 0^) = 0, and so we will have 0 = 0". If noise is present in 
the equations of motion, then the error will not be zero at 0 = 05, but J will be 
minimized at that point. 

The assumption that we have enough sensors to measure the total state and 
all state derivatives is a strong assumption and often not realistic in continuous 
model identification. However, in discrete linear models there is one case where 
it is immediate, and that is the case of an ARMA model. The reason for this is 
not hard to find: In an ARMA model the state is no more than recent values of 


5 Methods to search for the minimizing Ó are the subject of nonlinear programming, which is discusscd in 
Luenberger (1973). We give a brief discussion later in this chapter connected with maximum likelihood 
estimates. 
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input and output! To be explicit about it, let us write the linear, discrete model 
equation error, which is 


X (k + 1) — dx (k) — Tu, (k) = e(k; 8), (12.43) 


where ® and I are functions of the parameters 0. Now let us substitute the values 
from Eq. (12.39), the ARMA model 


x,(k + 1) m —U =a, ph D x, (k) 
x(k +1) 1 0 0o 0 0 0 x(k) 
xwerbD]] 90.1 8 od E E 
x, +1) 0 0 0 00 0 |] x(k) 
x, +1) D. «0- ea Ya ORO 9] si 09) 
x(k +1) @ of «wo 4 oJ 

0 e,(k) 

0 e,(k) 

= : uio ps (12.44) 
4 
0 e,(k) 
0 e,(k) 


When we make the substitution from Eq. (12.40) we find that, for any 0 (which 
is to say, for any values of a, and b,) the elements of equation error are all zero 
except e,, and this element of error is given by 


x(k+1) + a x(k) +a, x,(k) + a, x,(k) 
bur (E) — b, x(k) — b, x(k) = e (k; 0) 
or 
y, 0) +a, y,(k — 1) +a, y,(k — 2) + a, y, (k — 3) 
— b u,(k — 1) bu tk — 2) — b, u,(k — 3) = e (k; 0). (12.45) 


The performance measure Eq. (12.42) becomes 


N 
J(8) = 3 ek; 0). (12.46) 
k=0 
Again, we place the subscript a on the observed data to emphasize the fact that 
these are actual data that were produced via the plant with (we assume) parameter 
values 0° = [aj a; a} by b5 b3], and in Eq. (12.45) and Eq. (12.46) an error results 
because 0 differs from 0^. 


Output Error 


As we have seen, the general case of equation error requires measurement of 
all elements of state and state derivatives. In order to avoid these assumptions a 
criterion based on output error can be used. This error is formulated in Fig. 12.7. 


Figure 12.7 

Block diagram showing 
the formulation of 
output error 


output error 
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Here we see that no attempt is made to measure the entire state of the plant, but 
rather the estimated parameter 0 is used in a model to produce the model output 
y, (A), which is a function, therefore, of 0 in the same way that the actual output 
is a function of the true parameter value 0". In the scalar output case the output 
error e, will be a scalar and we can form the criterion function 


N 
J(89 = 3:10) (1247) 
k=0 


and search for that Ô which makes T (6) as small as possible. To illustrate the 
difference between output error and equation error in one case, consider again 
the ARMA model for which we have 


Yn (k) = —a, Ym (k — 1) — a, y, (k — 2) — a, y,,(k — 3) 
+ b uj(k — 1) +b, u,(k — 2) +b, u,(k — 3) 


and then the output error 


e,(k; 0) = y,(k) — y, (k) 
= y, (k) +a, yo 0e 1) +a, Y„(k — 2) 0: Yn (k — 3) 
—b u, (k — 1) — b, u (k — 2) — b, u,(k — 3). (12.48) 


If we compare Eq. (12.48) with Eq. (12.45) we see that the equation error 
formulation has past values of the actual output, and the output error formulation 
uses past values of the model output. Presumably the equation error is in some 
way better because it takes more knowledge into account, but we are not well 
situated to study the matter at this point. 


Prediction Error 


The third approach to developing an error signal by which a parameter search 
can be structured is the prediction error. When we discussed observers in Chapter 
8 we made the point that a simple model is not a good basis for an observer 
because the equations of motion of the errors cannot be controlled. When we add 
random effects into our data-collection scheme, as we are about to do, much the 
same complaint can be raised about the generation of output errors. Instead of 
working with a simple model, with output y, (X), we are led to consider an output 
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prediction error 


12.4 


Figure 12.8 

Block diagram showing 
the generation of 
prediction error 


predictor that, within the confines of known structures, will do the best possible 
job of predicting y(k) based on previous observations. We will see that in one 
special circumstance the ARMA model error as given by Eq. (12.45) has the least 
prediction error, but this is not true in general. Following our development of the 
observer, we might suspect that a corrected model would be appropriate, but we 
must delay any results in that area until further developments. For the moment, 
and for the purpose of displaying the output prediction error formulation, we 
have the situation pictured in Fig. 12.8. 


Deterministic Estimation 


Having defined an error that depends on 0, we can formulate a performance cri- 
terion and search for that @ which is the best estimate of 0? within the context of 
the defined performance. To illustrate this use of error we have already defined a 
J (0) in Eq. (12.42), Eq. (12.46), and Eq. ( 12.47), in each case a sum of squares. 
The choice of a performance criterion is guided both by the computational diffi- 
culties it imposes in the effort to obtain 6 and by the properties of the estimate 
that results, that is, by how hard 6 is to find and how good it is when found. 
Among the criteria most widely used are: 


1. least-squares estimate (LS), 
2. best linear unbiased estimate (BLUE), and 
3. maximum likelihood estimate (MLE) 


The last two criteria require a stochastic element to be introduced into the 
model, and our understanding of least squares is enhanced by a stochastic perspec- 
tive. However, before weintroduce random noise into the picture, it is informative 
to consider deterministic least squares. Thus we will discuss least squares first, 
then introduce random noise and discuss the formulation of random errors, and 
finally define and discuss the BLUE and the MLE, their calculation and their 
properties. 


Prediction 
8 


12.4.1 
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Least Squares 


To begin our discussion of least squares, we will take the ARMA model and 
equation error, which leads us to Eq. (12.45), repeated below for the nth-order 
case, where the subscript a is understood but omitted 


y(k) +a, y(k — 1) +a,y(k — 2) +--+» +a yk —n) 
—b,u(k 1) —---- bulk ~n) = e(k; 0). (12.49) 
We assume that we observe the set of outputs and inputs 
1-:30)4—Y (D. o YU WO), n (09. MUN] 
and wish to compute values for 
8—[a,---a, b,--- b. Y, 


which will best fit the observed data. Because y (k) depends on past data back to 
n periods earlier, the first error we can form is e(n; 8). Suppose we define the 
vector of errors by writing Eq. (12.49) over and over fork =n,n+1,...,N. 
The results would be 
y(n) = $” (0 + e(n; 8), 
yn t 1) 2 (n - 19 +e(n + 1; 9), 


y(N) = $” (N)0 + e(N; 9), (12.50) 
where we have used the fact that the state of the ARMA model isf 
p(k) = [-y(k — 1) — y(k 22) ---u(k 2 1) ---u(k =n)]’. 


To make the error even more compact, we introduce another level of matrix 
notation and define 


Y(N) =[y(n) «+» x(NY', 
DN) = [ó(n) 6n 1) --- $N)", 
€(N; 9) = [e(n) --- e(N))', 
81g, «9,5; es TS (12.51) 


Note that P(N) is a matrix with 2n columns and N — n + 1 rows. In terms of 
these, we can write the equation errors as 


Y = $0 + e(N; 0). (12.52) 
6 We use 6 for the state here rather than x as in Eq. (12.44) because we will soon be developing equations 


for the evolution of @(k), estimates of the parameters, which will be the state of our identification 
dynamic system. 
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Least squares is a prescription that one should take that value of 0 which 
makes the sum of the squares of the e(k) as small as possible. In terms of 
Eq. (12.50), we define 


N 
J(8) = >> e*(k; 6), (12.53) 
k=n 


and in terms of Eq. (12.52), this is 
J (8) = €'(N; 0)e(N; 9). (12.54) 


We want to find 6 
the property 


us» the least-squares estimate of 0^, which is that 0 having 


IÔ) = 7(9). (12.55) 


But 7 (0) is a quadratic function of the 2n parameters in 0, and from calculus we 
take the result that a necessary condition on 6,, is that the partial derivatives of 


J with respect to 0 at 0 = 6, . should be zero. This we do as follows 


I= ee 
= (Y — 0) (Y - $0) 
= YTY — 07 y — Y' oe + 0? $! oo; 


and applying the rules developed above for derivatives of scalars with respect to 
vectors! we obtain 


mn EJ = —2YT 6 420/07 o. (12.56) 


If we take the transpose of Eq. (12.56) and let 0 — 6.5: we must get zero; thus 
©'b6,, = D'Y. (12.57) 


These equations are called the normal equations of the problem, and their solution 
will provide us with the least-squares estimate 9, .. 

Do the equations have a unique solution? The answer depends mainly on 
how 0 was selected and what input signals {u(&)} were used. Recall that earlier 
we saw that a general third-order state model had fifteen parameters, but that only 
six of these were needed to completely describe the input-output dependency. 
If we stayed with the fifteen-element 0, the resulting normal equations could 
not have a unique solution. To obtain a unique parameter set, we must select a 
canonical form having a minimal number of parameters, such as the observer or 
ARMA forms. By way of definition, a parameter 0 having the property that one 


7 The reader who has not done so before should write out a" Qa for a 3 x 3 case and verify that 
da’ Qa/8a = 2a" Q. 
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and only one value of 0 makes ,7 (0) a minimum is said to be “identifiable.” Two 
parameters having the property that .7(0,) = .7(0,) are said to be “equivalent.” 

As to the selection of the inputs u(k), let us consider an absurd case. Suppose 
u(k) = c for all k — a step function input. Now suppose we look at Eq. (12.50) 
again for the third-order case to be specific. The errors are 


y(3) = —a,yQ) — a, y(1) — ay) + b,c + b,c + b,c + €(3), 


(12.58) 


y(N) = —a,y(N,) — a(N — 2) —a,(N — 3) + b,c + b,c + b,c + e(N). 
It is obvious that in Eq. (12.58) the parameters b, b,, and b, always appear as 
the sum b, + b, + b, and that separation of them is not possible when a constant 
input is used. Somehow the constant u fails to “excite” all the dynamics of the 
plant. This problem has been studied extensively, and the property of "persis- 
tently exciting" has been defined to describe a sequence {u(k)} that fluctuates 
enough to avoid the possibility that only linear combinations of elements of 
0 will show up in the error and hence in the normal equations [Ljung(1987)]. 
Without being more specific at this point, we can say that an input is persis- 
tently exciting of order n if the lower right (n x n)-matrix component of ®’® 
[which depends only on (u(k))] is nonsingular. It can be shown that a sig- 
nal is persistently exciting of order n if its discrete spectrum has at least n 
nonzero points over the range 0 < wT < x. White noise and a pseudo-random 
binary signal are examples of frequently used persistently exciting input sig- 
nals. 
For the moment, then, we will assume that the u(k) are persistently exciting 
and that the 0 are identifiable and consequently that ®’ ® is nonsingular. We can 
then write the explicit solution 


6, = ($D) 9v. (12.59) 


It should be especially noted that although we are here mainly interested in iden- 
tification of parameters to describe dynamic systems, the solution Eq. ( 12.59) 
derives entirely from the error equations Eq. (12.52) and the sum of squares cri- 
terion Eq. (12.54). Least squares is used for all manner of curve fitting, including 
nonlinear least squares when the error is a nonlinear function of the parameters. 
Numerical methods for solving for the least-squares solution to Eq. (12.57) with- 
out ever explicitly forming the product ®’ ® have been extensively studied [see 
Golub (1965) and Strang (1976)]. 

The performance measure Eq. (12.53) is essentially based on the view that 
all the errors are equally important. This is not necessarily so, and a very simple 
modification can take account of known differences in the errors. We might 
know, for example, that data taken later in the experiment were much more in 
error than data taken early on, and it would seem reasonable to weight the errors 
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accordingly. Such a scheme is referred to as weighted least squares and is based 
on the performance criterion 


N 
I (0) = 5 w(kye*(k; 0) = e" We. (12.60) 
k=n 
In Eq. (12.60) we take the weighting function w(k) to be positive and, presumably, 
to be small where the errors are expected to be large, and vice versa. In any event, 
derivation of the normal equations from Eq. (12.60) follows at once and gives 


7 A : 
P W$9,.— o'’wy, 
and, subject to the coefficient matrix being nonsingular, we have 
és = (P WH) OWY. (12.61) 


We note that Eq. (12.61) reduces to ordinary least squares when W = I, the 
identity matrix. Another common choice for w(k) in Eq. ( 12.60) is w(k) = 
(1 — y)y" * for y < 1. This choice weights the recent (k near N) observations 
more than the past (k near n) ones and corresponds to a first-order filter operating 
on the squared error. The factor 1 — y causes the gain of the equivalent filter to 
be | for constant errors. As y nears 1, the filter memory becomes long, and noise 
effects are reduced; whereas for smaller y, the memory is short, and the estimate 
can track changes that can occur in 0 if the computation is done over and over as 
N increases. The past is weighted geometrically with weighting y, but a rough 
estimate of the memory length is given by 1/(1 — y); so, for example, a y = 0.99 
corresponds to a memory of about 100 samples. The choice is a compromise 
between a short memory which permits tracking changing parameters and a long 
memory which incorporates a lot of averaging and reduces the noise variance. 


Recursive Least Squares 


The weighted least-squares calculation for 6b wis given in Eq. (12.61) is referred 
to as a “batch” calculation, because by the definition of the several entries, the 
formula presumes that one has a batch of data of length N from which the 
matrices Y and P are composed according to the definitions in Eq. (12.51 ), and 
from which, with the addition of the weighting matrix W, the normal equations 
are solved. There are times when the data are acquired sequentially rather than in 
a batch, and other times when one wishes to examine the nature of the solution 
as more data are included to see whether perhaps some improvement in the 
parameter estimates continues to be made or whether any surprises occur such 
as a sudden change in 0 or a persistent drift in one or more of the parameters. In 
short, one wishes sometimes to do a visual or experimental examination of the 
new estimates as one or several more data points are included in the computed 
values of 6, .. 


12.4 Deterministic Estimation 507 


The equations of Eq. (12.61) can be put into a form for sequential processing 
of the type described. We begin with Eq. (12.61) as solved for N data points and 
consider the consequences of taking one more observation. We need to consider 
the structure of ©’ W« and ®’ WY as one more datum is added. Consider first 
PWO. To be specific about the weights, we will assume w = ay "-*. Then, 
if a = | and y = 1, we have ordinary least squares; and if a = 1 — y, we have 
exponentially weighted least squares. From Eq. (12.51) we have, for data up to 
time N + 1 


7 = [b(n)--- O(N) O(N + 1)] 


and 
NI NI 
$^ WH = 3 6(0w( (k) = 9 day" e (0, 
k=n kzn 


which can be written in two terms as 


N 
DW = X` é()ayy "9 (o) + O(N + Dae (N 1) 


k=n 


=y (N)YW(N)®(N) + (N + LDab™(N +1). (12.62) 


From the solution Eq. (12.61) we see that the inverse of the matrix in Eq. (12.62) 
will be required,® and for convenience and by convention we define the 2n x 2n 
matrix P as 


P(N +1) 2 [DN + DDW(N + 1)". (12.63) 
Then we see that Eq. (12.62) can be written as 
P(N +1) = [YP (N) + O(N + Dao! (N +1), (12.64) 


and we need the inverse of a sum of two matrices. This is a well-known problem, 
and a formula attributed to Householder (1964) known as the matrix inversion 
lemma is 


(A +BCD)"' = A^ - A'B(C'+DA'B)'DA”’. (12.65) 
To apply Eq. (12.65) to Eq. (12.62) we make the associations 
A= yP^'(N), 
B=O(N+1)=9, 
C=w(N +1) =a, 
D-4'(N41)24!, 


8 We assume for this discussion the existence of the inverse which assumes inputs that are persistently 
exciting and parameters that are identifiable. 
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and we find at once that 


N N 1 P(N) V! P(N 
pv +1) = EOD LP (Lear Ce) gi 269 
Y a y Y 
In the solution we also need ®’ WY, which we write as 
oe D y(n) 
DWY = [60) --6(N). O(N +1)] k 
ay y(N) 
O a y(N +1) 
(12.67) 


which can be expressed in two terms as 
$^WY(N +1) 2 y 9 WY(N) + O(N +1)ay(N + 1). (12.68) 


If we now substitute the expression for P(N 4- 1) from Eq. (12.66 ) and for 
®WY(N +1) from Eq. (12.68) into Eq. (12.61), we find [letting P(N) = 
P, $(N + 1) = 6, and y(N + 1) = y for notational convenience] 


A P P l p X! wp 
é (N+1)= E --6 E t9 - J eT [VÐ WY (N) + hay]. 
yY Y a Y Y 


(12.69) 
When we multiply the factors in Eq. (12.69), we see that the term P®’WY(N) = 
0, s (V), so that Eq. (12.69) reduces to 


^ 


" T P P l p x 
Ows N +D) = Owis(N) F y day — E o (G + Pa J 676... 


l 
a 


E e(l e) quis (12.70) 
y M Y 


If we now insert the identity 


(- gno 2] (< pte. J 
a Y a y 


between the ¢ and the a in the second term on the right of Eq. ( 12.70), we can 
combine the two terms which multiply y to reduce Eq. (12.70) to 


Ôw (N+ 1) = 9,0) HLN + DO(N D — 7 8,,.(N)), — (02710) 
where we have defined 
7 -1 
uv eos Fe(1 278) ; (12.72) 
a a X 


Equations (12.66), (12.71), and (12.72) can be combined into a set of steps that 
constitute an algorithm for computing 0 recursively. To collect these, we proceed 
as follows: 


, 
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- 


. Select a y, and N. 


N 


. Comment: a = y = 1 is ordinary least squares; a = 1 — y and0 < y <1 
is exponentially weighted least squares. 

. Select initial values for P(N) and 6(N ). Comment: See discussion below. 

. Collect y(0),..., y(N) and u(0),..., u(N) and form $! (N +1). 

. Letk € N. 


3 
4 
5 
(k) m 
6. ETE] D etep(z toat k+) 
7 
8 
9 


. Collect y(k + Dand u(k +1). 
. Ok +1) — 6) + LK + DO (k + 1) — $” (k + 108(&)) 


. P(k +1) | -ü — L(k + 16 (k + 1)]P(k) 


10. Form b(k + 2). 
11. Letk < k +1. 
12. Go to Step 6. 


Especially pleasing is the form of Step 8, the "update" formula for the next 
value of the estimate. We see that the term ^ 6(N ) is the output to be expected at 
the time N + 1 based on the previous data, 6(N + 1), and the previous estimate, 
6(N). Thus the next estimate of @ is given by the old estimate corrected by a 
term linear in the error between the observed output, y(N + 1), and the predicted 
output, 7 6(N). The gain of the correction, L(N + 1), is given by Eq. (12.71) 
and Eq. (12.72). Note especially that in Eq. (12.71) no matrix inversion is required 
but only division by the scalar 


ique. 
a y 


However, one should not take the implication that Eq. (12.71) is without numerical 
difficulties, but their study is beyond the scope of this text. 

We still have the question of initial conditions to resolve. Two possibilities 
are commonly recommended: 


1. Collectabatchof N > 2n data values and solve the batch formula Eq. (12.61) 
once for P(N), L(N + 1), and O(N), and enter these values at Step 3. 
2. Set0(N) = 0, P(N) = al, whereo is a large scalar. The suggestion has been 


made that an estimate of a suitable œ is [Soderstrom, Ljung, and Gustavsson 
(1974)] 


a= (1) yy (i). 
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The steps in the table update the least-squares estimate of the parameters 0 
when one more pair of data points 4 and y are taken. With only modest effort 
we can extend these formulas to include the case of vector or multivariable 
observations wherein the data y(k) are a vector of p simultaneous observations. 
We assume that parameters 0 have been defined such that the system can be 
described by 


y(k) = b” ()0 + elk; 0). (12.73) 


One such set of parameters is defined by the multivariable ARMA model 
yk) 2 - ayk - i + Y But i), (12.74) 
i=l i=l 


where the a; are scalars, the B, are p x m matrices, 0 is (n +nmp) x 1, and 
the (k) are now (n +nmp) x p matrices. If we define ®, Y, and e as in 
Eq. (12.51), the remainder of the batch formula development proceeds exactly 
as before, leading to Eq. (12.59) for the least-squares estimates and Eq. (12.61) 
for the weighted least-squares estimates with little more than a change in the 
definition of the elements in the equations. We need to modify Eq. (12.60) to 
J = Ze” we, reflecting the fact that €(k) is now also a p x 1 vector and the w(k) 
are p x p nonsingular matrices. 

To compute the recursive estimate equations, we need to repeat Eq. (12.62) 
and the development following Eq. (12.62) with the new definitions. Only minor 
changes are required. For example, in Eq. (12.66) we must replace 1/a by a7’. 
The resulting equations are, in the format of Eq. (12.71) 


-1 
LN +=" e(a eee) (a) 


P(N +1) = -d- LW + 1)b")P, (b) 


GO + D = Oy, HLN + DIy(N + 1) — 676, (N) (c) (0275) 


Stochastic Least Squares 


Thus far we have presented the least-squares method with no comment about the 
possibility that the data might in fact be subject to random effects. Because such 
effects are very common and often are the best available vehicle for describing 
the differences between an ideal model and real-plant signal observations, it is 
essential that some account of such effects be included in our calculations. We 
will begin with an analysis of the most elementary of cases and add realism (and 
difficulties) as we go along. Appendix D provides a brief catalog of results we 
will need from probability, statistics, and stochastic processes. 
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As a Start, we consider the case of a deterministic model with random errors 
in the data. We consider the equations that generate the data to be? 


y(k) = a70? + v(k), (12.76) 
which, in matrix notation, becomes 
Y — A0? 4 V. CUZ) 


The v(k) in Eq. (12.76) are assumed to be random variables with zero mean 
(one can always subtract a known mean) and known covariance. In particular, we 
assume that the actual data are generated from Eq. (12.77) with 0 = 0? and 


Ev(k) = 0, 
"eun Jr) 
€vwv(k)v(]) =a ô = d? iiy) (12.78) 
or 
EVV’ =o’. 


As an example of the type considered here, suppose we have a physics 
experiment in which observations are made of the positions of a mass that moves 
without forces but with unknown initial position and velocity. We assume then 
that the motion will be a line, which may be written as 


y(t) =a? + b?t +n(t). (12.79) 


If we take observations at times liti stus then 


T 
Y=[y@).--yty), A= ( ° i ^ w ) l 
the unknown parameters are 
@=[a b], 
and the noise is 
v(k) =n(t,). 


We assume that the observations are without systematic bias—that is, the noise 

has zero average value—and that we can estimate the noise “intensity,” or mean- 

square value, o*. We assume zero error in the clock times, t,, SO A is known. 
The (stochastic) least-squares problem is to find Ô which will minimize 


4(0) = (Y — A0)'(Y — A0) 


-Y e 9. (12.80) 
k=0 


9 There is no connection between the a in (12.76) and the a used as part of the weights in Section 12.4. 
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consistent estimate 


Note that the errors, e(k; 0), depend both on the random noise and on the choice 
of 0, and e(k; 99) = v(k). Now the solution will be a random variable because 
the data, Y, on which it is based, is random. However, for specific actual data, 
Eq. (12.80) represents the same quadratic function of 0 we have seen before; 
and the same form of the solution results, save only the substitution of A for ®, 
namely 


e (ATA)"A"Y. (12.81) 


Now, however, we should not expect to find zero for the sum of the errors given 
by Eq. (12.80), even if we should determine 6-0 exactly (which of course we 
won't). Because of the random effects, then, we must generalize our concepts 
of what constitutes a “good” estimate. We will use three features of a stochastic 
estimate. The first of these is BORSE 

An estimate 6 of a parameter @° is said to be consistent if, in the long run, 
the difference between 6 and 0° becomes negligible. In statistics and probability, 
there are several formal ways by which one can define a negligible difference. 
For our purposes we will use the mean-square criterion, by which we measure 
the difference between 6 and 9? by the sum of the squares of the parameter error, 
6- 0°. We make explicit the dependence of 6 on the length of the data by writing 
ÔN) and say that an estimate is consistent if '? 


Jim £(6(N) — 9)'(6(N) — 0°) 20 
Jim tr £(6(N) — 0°)(6(N) — 09)" 2 0. (12.82) 


If Eq. (12.82) is true, we say that 6(N ) converges to 0? in the mean- square sense 
as N approaches infinity. The expression in Eq. (12.82) can be made more explicit 
in the case of the least-squares estimate of Eq. (12.81). We have 

6,, — 0° (ATA)'AT (A0 + V) — 0° 
€ + (A7A)'ATV — 6° 
(ATA)! ATV 


and 
£(6,, — 0°)(6,, — 0°)” = £(A7A)"'ATVV' A(ATA)-] 
= (ATA)! ATEVV' A(ATA)^! 
= o*(A'" A). (12.83) 


In making the reductions shown in the development of Eq. (12.83) we have 
used the facts that A is a known matrix (not random) and that EVV? = c?I. 


10 The trace (tr) of a matrix A with elements a; is the sum of the diagonal elements, or trace A = Ya 
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Continuing then with consideration of the least-squares estimate, we see that M 
is consistent if 


Jim tra? (A A)! — 0. (12.84) 
As an example of consistency, we consider the most simple of problems, the 


observation of a constant. Perhaps we have the mass of the earlier problem, but 
with zero velocity. Thus 


y(k) =a°+v(k), k=0,1,...,N, (12.85) 
and 
A= 1...17, © =a, 
N N 
A'A- Y eNi ATY=)_ yh); 
k=0 k=0 
then 
2 N 
ôs (N) = (N + 7! > y(0. (12.86) 
k=0 


which is the sample average. 
Now, if we apply Eq. (12.84), we find 
2 


; 2 -i P c NN 
lim tro (N + 1) SANG a] zu 
and we conclude that this estimate Eq. (12.86) is a consistent estimate of a°. If 
we keep taking observations according to Eq. (12.85) and calculating the sum 
according to Eq. (12.86), we will eventually have a value that differs from a? by 
a negligible amount in the mean-square sense. 

The estimate given by Eq. (12.86) is a batch calculation. It can be informative 
to apply the recursive algorithm of Eq. (12.52) to this trivial case just to see how 
the equations will look. Suppose we agree to start the equations with one stage 
of "batch" on, say, two observations, y(0) and y(1). We thus have a — y — 1, 
for least squares, and 


(12.87) 


Puy = ATAJ A 

$(N) = 1, m » 

ôa) = i90) +x), initial conditions (12.88) 
Wi Th: 


The iteration in P is given by (because  z 1) 
P(N)O)Q) P(N) 


REDE e ES PONG) 


P(N) 
1+ P(N) 
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Thus the entire iteration is given by the initial conditions of Eq. (12.88) plus: 


1. LetN=1 
2. P(N+- 1) = SR n (12.89) 
1+ P(N) 
3. L(N4+ 1) = dab, 
~ L+ P(N) 


4, O(N + 1) = O(N) + LIN + DON + 1) — 8(N)). 
. Let N be replaced by N + 1. 
6. Go to Step 2. 


n 


The reader can verify that in this simple case, P(N) = 1/(N + 1), and Step 4 
of the recursive equations gives the same (consistent) estimate as the batch 
formula Eq. (4.60). We also note that the matrix P is proportional to the variance 
in the error of the parameter estimate as expressed in Eq. (12.83). 

Although consistency is the first property one should expect of an estimate, 
it is, after all, an asymptotic property that describes a feature of 0 as N grows 
without bound. A second property that can be evaluated is that of “bias.” If 0 
is an estimate of 0, the bias in the estimate is the difference between the mean 
value of 8 and the true value, 0°. We have 


£6(N)- 0? —b (12.90) 


If b = 0 for all N, the estimate is said to be unbiased. The least-squares estimate 
given by Eq. (12.59) is unbiased, which we can prove by direct calculation as 
follows. If we return to the development of the mean-square error in 0,. given 
by Eq. (12.83) we find that 
6,, — 6? = (ATA)! AT V, (12.91) 


LS 


and the bias is 
EÔ, — 0° = £(ATA)"ATV 
= (ATA)! ATEV 
= 0. (12.92) 


Actually, we would really like to have 6(N) be "close" to 0° for finite values 
of N and would, as a third property, like to treat the mean-square parameter error 
for finite numbers of samples. Unfortunately, it is difficult to obtain an estimate 
that has a minimum for the square of @ — 0? without involving the value of 6° 
directly, which we do not know, else there is no point in estimating it. We can, 
however, find an estimate that is the best (in the sense of mean-square parameter 
error) estimate which is also linear in Y and unbiased. The result is called a Best 
Linear Unbiased Estimate, or BLUE. 
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The development proceeds as follows. Because the estimate is to be a linear 
function of the data, we write!’ 


ó-L.Y. (12.93) 
Because the estimate is to be unbiased, we require 
£6 = e, 
or 
ELY = £L(A0? + V) = 0°. 
Thus 
LAO® = 0°, 
or 
LA =I. (12.94) 


We wish to find L so that an estimate of the form Eq. (12.93), subject to the 
constraint Eq. (12.94), makes the mean-square error 


J(L) = tr £(0 — 0°)(6 — 0°)” (12.95) 
as small as possible. Using Eq. (12.93) in Eq. (12.95), we have 
J(L) = wE{(LY — 0°)(LY — 057), 
and using Eq. (12.77) for Y 
J(L) = tr£(LA0? + LV — 0°)(LA@° + LV — 8°)’. 
But from Eq. (12.94), this reduces to 


J(L) = w£(LV)(LV)" 
=trLRL’, (12.96) 


where we take the covariance for the noise to be 
EVV' =R. 


We now have the entirely deterministic problem of finding that L subject to 
Eq. (12.94) which makes Eq. (12.96) as small as possible. We solve the problem 
in an indirect way: We first find L when @° is a scalar and there is no trace 
operation (L is a single row). From this scalar solution we conjecture what 
the multiparameter solution might be and demonstrate that it is in fact correct. 
First, we consider the case when L is a row. We must introduce the constraint 


1i The matrix L used here for a linear dependence has no connection to the gain matrix used in the 
recursive estimate equation (12.72). 
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best linear unbiased 
estimate (BLUE) 


Eq. (12.94), and we do this by a Lagrange multiplier as in calculus and are led to 
find L such that 


J (L) = LRL" + A(A™L? -D (12.97) 


is a minimum. The A are the Lagrange multipliers. The necessary conditions on 
the elements of L are that 8.7/9L’ be zero. (We use L”, which is a column, 
to retain notation that is consistent with our earlier discussion of vector-matrix 
derivatives.) We have 


i5 "e 2LR + AA? — 0. 
Thus the best value for L is given by 
2LR = —AAT, 
L= -PAAURO. (12.98) 
Because the constraint Eq. (12.94) must be satisfied, L must be such that 
LA =1, 
or 
-SRATR'A -L 
From this we conclude that 
à = —2(ATR^! A)", (12.99) 
and substituting Eq. (12.99) back again in Eq. (12.98), we have, finally, that 
L = (A'R'A) A'R"!, (12.100) 
and the BLUE is 
6, = LY = (ATR^'A)-'A7R^' Y. (12.101) 


If we look at Eq. (12.61), we immediately see that this is exactly weighted least 
squares with W = R^!. What we have done, in effect, is give a reasonable—best 
linear unbiased-—criterion for selection of the weights. If, as we assumed earlier, 
R — c?], then the ordinary least-squares estimate is also the BLUE. 

But we have jumped ahead of ourselves; we have yet to show that Eq. (12.100) 
or Eq. (12.101) is true for a matrix L. Suppose we have another linear unbiased 
estimate, 0. We can write, without loss of generality 


6 = LY = (L+ Dy, 
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where L is given by Eq. (12.100). Because is required to be unbiased, Eq. (12.94) 
requires that 


LA-L @+Da=i1, LaA+LA=1, 
I+LA=I, LA=0. ueste) 


Now the mean-square error using @ is 
J(8) = r E(@ — 0°)(6 — 8°)’, 
which can be written as 
7(8) = trE@ — Ô + 6 — 0)(9 — 6 + 6 — 0°)” 
= tr(£(9 — 6)(6 — 6)" + 2£(0 — 6)(6 — 0°)” + £(0 — 8°) (6 — 057). 
(12.103) 
We note that the trace of a sum is the sum of the traces of the components, and 
the trace of the last term is, by definition, 7(L). Let us consider the second term 
of Eq. (12.103), namely 
term2 = tr£(9 — 6)(6 — 0°)” = tr€(LY — LY)(LY — 957 
= tr€(LY + LY - LY)(LY - 0°) 
= tr€(LY)(LY — 0°) 
= tr£(L(A9? + V))(L(A0? + V) — 0°)”. 


Now we use Eq. (12.102) to eliminate one term and Eq. (12.94) to eliminate 
another to the effect that 


term2 = tr€(LV)(LV)’ 
= trLRL’, 
and using Eq. (12.100) for L, we have 
term2 = tr LR{R A(A7R!A)!] 
= trLA(A7R™'A)"!, 


but now from Eq. (12.102) we see that the term is zero. Thus we reduce 
Eq. (12.103) to 


4(9) = w£(0 — ôO — 6)” + 7 (6). 


Because the first term on the right is the sum of expected values of squares, it 
is zero or positive. Thus .7(8) > J (0), and we have proved that Eq. (12.101) is 
really and truly BLUE. 

To conclude this section on stochastic least squares, we summarize our 
findings as follows. If the data are described by 


Y = A0? + V, 
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and 
EV=0, — EVV- RR 
then the least-squares estimate of 0? is given by 
6, SATA ATY, (12.104) 


which is unbiased. If R = o7I, then the variance of 0, ., which is defined as 


E(6,,—0°)(6,, — 057, is 


LS? 


var(0,.) = a^(ATA)"!. (12.105) 


From this we showed that the least-squares estimation of a constant in noise is 
not only unbiased but also consistent. 

Furthermore, if we insist that the estimate be both a linear function of the 
data and unbiased, we showed that the BLUE is a weighted least squares given 
by 


6, = (A"R^'A)'ATR^'Y. (12.106) 
The variance of 6, is 


var (6 ,) = EÔ, — 6°)(6, -097 
= (ATR^! A)". (12.107) 


Thus, if R = o7I, then the least-squares estimate is also the BLUE. 
As another comment on the BLUE, we note that in a recursive formulation 
we take, according to Eq. (12.63) 


P = (A"R^"A)", 


and so the matrix P is the variance of the estimate 6 g- In the recursive equations 
given for vector measurements in Eq. (12.75), the weight matrix a becomes R;'!, 
the inverse of the covariance of the single time measurement noise vector and, of 
course, y = |. Thus the BLUE version of Eq. (12.75) is 


L(V +1) 2 P9(R, + b'P)’. 


Thus far we have considered only the least-squares estimation where A, 
the coefficient of the unknown parameter, is known. However, as we have seen 
earlier, in the true identification problem, the coefficient of 6 is 6, the state of the 
plant model; and if random noise is present, the elements of ^ will be random 
variables also. Analysis in this case is complex, and the best we will be able 
to do is to quote results on consistency and bias before turning to the method 
of maximum likelihood. We can, however, illustrate the major features and the 
major difficulty of least-squares identification by analysis of a very simple case. 
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Suppose we consider a first-order model with no control. The equations are taken 
to be 


y(k) = a®y(k — 1) + v(k) + cwk — 1), 

Ev(k) =0, 
o? k=j 
Qi "Ex. 
In general, we would not know either the constant c or the noise intensity c?. We 
will estimate them later. For the moment, we wish to consider the effects of the 
noise on the least-squares estimation of the constant a°. Thus we take 

Y =[y(1)--- (NI, 

$ = [y(0)---y(N — DI’, 


O=a; 


Ev(k)u(j) = (12.108) 


Then we have the simple sums 
N 


N 
7b =) yk-Dyk-1, Y=) yk- Dy), 
k=l 


k=l 


and the normal equations tell us that 6 must satisfy 


N N 
» nli- Ye — 1)y(k). (12.109) 
k=) k=l 


Now we must quote a result from statistics (see Appendix D). For signals such as 
y(k) that are generated by white noise of zero mean having finite intensity passed 
through a stationary filter, we can define the autocorrelation function 


R G) = &y()y(k + J) (12.110) 


and (in a suitable sense of convergence) 
l N 
im — i) = R (j). i 
dim y 2. OE + D = RO) (12.111) 


Thus, while the properties of Ê for finite N are difficult, we can say that the 
asymptotic least-squares estimate is given by the solution to 


N N 
Jim. (2ra 0 ) = im p Èa- 1)y(k), 
which is 
R,(0)6 = R,(1), 
6 = R,(1)/R, (0). (12.112) 
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If we now return to the model from which we assume y(K) to be generated, values 
for R (0) and R.) can be obtained. For example, if we multiply Eq. (12.108) 
by y(k — 1) and take the expected value, we find 


£y(k — 1)y(k) = Ea®y(k — 1)y(k — 1) + £v(k)y(k — 1) 
+cEv(k — 1)y(k — 1), 
R,(1) =a°R (0) + cEv(k — 1)y(k — 1). (12.113) 
We set Ev(k)y(k — 1) = 0 because y(k — 1) is generated by v(j) occurring at 
or before time k — 1 and y is independent of (uncorrelated with) the noise v(j), 


which is in the future. However, we still need to compute Ev(k — 1)y(k — 1). 
For this, we write Eq. (12.108) for k — 1 as 


y(k — 1) =a®y(k — 2) + wk — 1) + cv(k — 2) 
and multiply by v (k — 1) and take expected values, 


Ey(k — Dw(k — 1) = Ea? y(k — 2)u(k — 1) + Ev? (k — 1) + cEv(k — 2)v(k — 1) 


=o’, 


We have, then, 


R,(1) =@°R,(0) + co’. (12.114) 
Now we can substitute these values into Eq. (12.112) to obtain 
^ R (1) 
oo) = 2 
R, (0) 
0 
= , 12.115 
a ^c R O) ( ) 


y 


If c =0, then (oo) = a, and we can say that the least-squares estimate is 
asymptotically unbiased. However, if c = 0, Eq. (12.115) shows that even in this 
simple case the least-squares estimate is asymptotically biased and cannot be 
consistent.'? Primarily because of the bias introduced when the ARMA model 
has noise terms correlated from one equation to the next, as in Eq. (12.108) 
when c = 0, least squares is not a good scheme for constructing parameter 
estimates of dynamic models that include noise. Many alternatives have been 
studied, among the most successful being those based on maximum likeli- 
hood. 


12 Clearly, if the constant c is known we can subtract out the bias term, but such corrections are not very 
pleasing because the essential nature of noise is its unknown dependencies. In some instances it is 
possible to construct a term that asymptotically cancels the bias term as discussed in Mendel (1973). 
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12.6 Maximum Likelihood 


The method of maximum likelihood requires that we introduce a probability 
density function for the random variables involved. Here we consider only the 
normal or Gaussian distribution; the method is not restricted to the form of the 
density function in any way, however. A scalar random variable x is said to have 
a normal distribution if it has a density function given by 


1 bs 2 
f,@ = ae |l; S|. (12.116) 


It can be readily verified, perhaps by use of a table of definite integrals, that 
a f,(d£&- 1, 
Ex= | sf eain. 


£e - uy» f (£ — u’ f, EdE 2 0°. (12.117) 


The number c? is called the variance of x, written var(x), and c is the standard 
deviation. For the case of a vector-valued set of n random variables with a joint 
distribution that is normal, we find that if we define the mean vector p and the 
(nonsingular) covariance matrix R as 


€x = p 
E(x- m(x- u) = R, 
then 
a a v ite — 
f,G = [Qx y det RJ? exp | "AS p) R^ (£ w|. (12.118) 


If the elements of the vector x are mutually uncorrelated and have identical means 
uand variances c?, then R = oI, detR = (o?)", and Eq. (12.118) can be written 
as 


l 1 < 
f£, = Ono? E» (12.119) 
i=l 


Because the normal distribution is completely determined by the vector of means 
i. and the covariance matrix R, we often use the notation N (m, R) to designate 
the density Eq. (12.118). 

The maximum-likelihood estimate is calculated on the basis of an assumed 
structure for the probability density function of the available observations. Sup- 
pose, for example, that the data consist of a set of observations having a density 
given by Eq. (12.119), but having an unknown mean value. The parameter is 
therefore 6° = u. 
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The function? 


n _py 
f,(€|0) = (2107)? exp |- : M d (12.120) 


izl 


can be presented as a function of 0, as giving the density of a set of x, for any 
value of the population mean 0. Because the probability that a particular x, is in 
the rangea < x, x b is given by 


b 
Pda <x, <6)= f f, 6,1048, 


the density function is seen to be a measure of the "likelihood" for a particular 
value; when f is large in a neighborhood x°, we would expect to find many 
samples from the population with values near x°. As a function of the parameters, 
0, the density function f. (& | 9) is called the likelihood function. If the actual data 
come from a population with the density f (ë | 0°), then one might expect the 
samples to reflect this fact and that a good estimate for @° given the observations 
x — Dues 1j would be 0 = 60, where 0 is such that the likelihood function 
f, (x19) is as large as possible. Such anestimate is called the maximum-likelihood 


^ 


estimate, 0,, . Formally, 6,, is such that 
f, (x19, > f,(x10). (12.121) 


From Eq. (12.120) we can immediately compute Ó,. for the mean p, by 
setting the derivative of f (x|8) with respect to 6 equal to zero. First, we note 


that 
d 1 df 
f duh RE i 12.122 
Fr CT ) 


so that the derivative of the log of f is zero when df/d@ is zero." Because the 
(natural) log of the normal density is a simpler function than the density itself, 
we will often deal with the log of f. In fact, the negative of the log of f is used 
so much, we will call it the log-likelihood function and give it the functional 
designation 


—log f, (x|0) = £(x | 0). 
Thus, from Eq. (12.120), for the scalar parameter jz, we have 
n 14 6, - 9)’ 
(x10) = +5 log xo) + 75 ^ ——— (12.123) 


13 We read f,(&|6) as “the probability density of § given 0." 
14 It is not possible for f to be zero in the neighborhood of its maximum. 
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and 
ae ] c 
aco n a d 
m 351258 ! 


=- h [E0]. (12.124) 
a 


i=] 


If we now set 84/80 = 0, we have 


or 
^ Ll< 
0 =— > A (12.125) 


We thus find that for an unknown mean of a normal distribution, the maxi- 
mum-likelihood estimate is the sample mean, which is also least squares, un- 
biased, consistent, and BLUE. We have studied this estimate before. However, 
we can go on to apply the principle of maximum likelihood to the problem of 
dynamic system identification. 

Consider next the ARMA model with simple white-noise disturbances for 
which we write 


yk) = —aytk — 1) — -= — a y(k —n) 
+ bulk — 1) +--+ +b,ulk — n) + v(k), (12.126) 


and we assume that the distribution of V = [v(n) ... v(N)]? is N(0, 071). Thus 
we are assuming that the v(k) are each normally distributed with zero mean and 
variance c? and furthermore that the covariance between v (k) and v(j) is zero for 
k + j. Suppose, now, that a sequence of the y(k) are observed and that we wish to 
estimate from them the a,, b,, and c? by the method of maximum likelihood. We 
require the probability-density function of the observed y(k) for known values 
of the parameters. From a look at Eq. (12.126 ), it is apparent that if we assume 
that y, u, a,, and b, are all known, then we can compute v(k) from these observed 
y and u and assumed (true) a, and b,, and the distribution of y is immediately 
determined by the distribution of v. Using the earlier notation, we define 


V(N) = [v(n)...0(N)]", 
Y(N) =[y(n)... y(N)]', 
p(k) = [-y(k — D... — y(k 2 nu(k — 1) ...u(k — n)]", 
$(N) = [6(»)... 6(N)]', 
LR I EN OF a NDS (1201597) 
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Then Eq. (12.126) implies, again for the true parameters, that 
Y(N) = $0? + V(N). (12.128) 


Equation (12.128) is an expression of the input-output relation of our plant 
equations of motion. To obtain the probability-density function f (Y | 0°) as 
required for the method of maximum likelihood, we need only be able to compute 
the v(K) or in batch form, the V(W), from the y(k) or Y(N) because we are given 
the probability density of V. To compute V from Y requires the inverse model 
of our plant, which in this case is so trivial as to be almost missed; namely, we 
solve Eq. (12.128) for V to obtain 


V(N) = Y(N) — $(N)0?, (12.129) 


Because we have assumed that the density function of V is N(0, o7I), we can 
write instantly 
1 (Y — $05"(y — oo? 
fO 1,5 = Qual) "^ exp EE i (12.130) 
o 
where m = N — n + 1, the number of samples in Y. 
The likelihood function is by definition f (Y | 60), whichis to say, Eq. (12.130) 
with the 0? dropped and replaced by a general 0. As in the elementary example 
discussed above, we will consider the negative of the log of f as follows 


Ulo b9) (Y — $0) | | 
2 o? 
1 (Y — $0)” (Y — 90) 


=) eshte oe bos NST 
= Bee y 88 2 = i ; 


Our estimates, Ôu and G?,, , are those values of 0 and o? which make £(Y | 0) as 
small as possible. We find these estimates by setting to zero the partial derivatives 
of £ with respect to @ and o°. These derivatives are (following our earlier treatment 
of taking partial derivatives) 


Je i 
80 o 


&(Y|9) = —log 2x0?) "^? — log [eo |- 


ID dÂ n —o'Y] 20, (a) 


ee ^ Q5) (12.132) 


where the quadratic term Q is defined as 
^ T A 
Q-(Y-40,)(Y-40,). (© 
15 We deliberately formulated our problem so that this inverse would be trivial. If we add plant and 


independent sensor noise to a state-variable description, the inverse requires a Kalman filter. Because 
v(k) is independent of all past u and yMor any 9. e(k) is the prediction error. 
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We see immediately that Eq. (12.132) is the identical “normal” equation of the 
least-squares method so that 0,, = 0,. in this case. We thus know that Oy; 
is asymptotically unbiased and consistent. The equations for ei decouple from 


those for Ôu and solving Eq. (12.132(b)), we obtain (again using earlier notation) 
1 ^ a 
c= B —(Y - 0) (Y — 6) 
m m 
l ` à 
= — 2(k: ®). 12.133 
a 84) (12.133) 


Thus far we have no new solutions except the estimate for c? given in 
Eq. (12.133), but we have shown that the method of maximum likelihood gives 
the same solution for 0 as the least-squares method for the model of Eq. (12.126). 
Now let us consider the general model given by 


y(k) = - May — i) Y bk 2i) Y auk i) vlk); (02.134) 
i=l i= 


i=l i=l 


the distribution of V(N) = [v(n) ... v(N)]! is again taken to be normal, and 
V is distributed according to the (0, c?I) density. The difference between 
Eq. (12.134) and Eq. (12.126), of course, is that in Eq. (12.134) we find past 
values of the noise v(k) weighted by the c; and, as we saw in Eq. (12.115), the 
least-squares estimate is biased if the c; are nonzero. 

Consider first the special case where of the c's only c, is nonzero, and write 
the terms in Eq. (12.134) that depend on noise v(k) on one side and define z(k) 
as follows 


v(k) e v(k — 1) = y) + 3a y(k i) - Y bulk- i) 
i=l i=l 


= z(k). (12:135) 


Now let the reduced parameter vector be ð= lgd, "Dou. Bes composed 
of the a and b parameters but not including the c,. In a natural way, we can write 


Z(N) = Y(N) - 9. (12.136) 


However, because z(k) is a sum of two random variables v(k) and v(k — 1) , each 
of which has a normal distribution, we know that Z is also normally distributed. 
Furthermore, we can easily compute the mean and covariance of z 


Ez(k) E(v(k) +c,v(k — 1) =0 for allK, 
Ez(k)z(j) E(v(k) +c, v(k — IDEG) t cv — 1) 
o?^(14 cj) (k= ]) 
oc  (k2j-D 
ee (k=j+!) 

0 elsewhere. 
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Thus the structure of the covariance of Z(N) is 


R = £Z(N)Z'(N) 


lc c, 0 0 g 
6 iif @ € d 
- 0 a led we | a, (12.137) 
0 0 sa “ee, 
0 0 e, lc 
and, with the mean and covariance in hand, we can write the probability density 
of Z(N) as 
l 
g(Z(N) | 0°) = ((27)” det R)” "? exp(— 3Z R™'Z), (12.138) 


where m = N — n + 1. But from (12.136) this is the likelihood function if we 
substitute for Z and 0 as follows: 


i = E 
f(Y10) = [(2z)" det RJ"? exp[- ries —09)'R"(Y—490). (12.139) 
The negative of the log of f is again similar in form to previous results 
l ] E: = 
£(Y|0) = 7 log 2n + 2 log(det R) + zY — p0) R (Y — $0). (12.140) 


The major point to be made about Eq. (12.140) is that the log likelihood 
function depends on the a's and b’s through 0 and is thus quadratic in these 
parameters, but it depends on the c's (c, only in this special case) through both R 
and dert R, a dependence that is definitely not quadratic. An explicit formula for 
the maximum-likelihood estimate is thus not possible, and we must retreat to a 
numerical algorithm to compute 0,,, in this case. 


Numerical Search for the Maximum-Likelihood 
Estimate 


Being unable to give a closed-form expression for the maximum-likelihood esti- 
mate, we turn to an algorithm that can be used for the numerical search for 0,,,. 
We first formulate the problem from the assumed ARMA model of Eq. (12.134), 
once again forming the inverse system as 


v(k) = yk) + $ ayk-  - 3i bu(k-i) Y coki) (12.141) 
i=] isl i=l 


By assumption, v(k) has a normal distribution with zero mean and unknown 
(scalar) variance R,. As before, we define the successive outputs of this inverse as 


V(N) = [v(n) v(n +!) v(N)]’. (12.142) 
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This multivariable random vector also has a normal distribution with zero mean 
and covariance 
R- RI, 
= EVV’, (12.143) 


where I, is the m x m identity matrix, and m = N —n + l is the number of 
elements in V. Thus we can immediately write the density function as a function 
of the true parameters 0° = (AE. e EE ae e ^ as 


m m i l - v^(k) 
f(VI8) = (Qx)" R?) "exp E» | (12.144) 


The (log) likelihood function is found by substituting arbitrary parameters, 0, 
in Eq. (12.144) and using e(k) as the output of Eq. (12.141) when 0 = 6°, then 
taking the log which gives the result 


€(E|0) = M log2n + M log R + ae 5 27) (12.145) 
2 2 " ERAI 
As with Eq. (12.134) we can compute the estimate of R, by calculation of 
9£/8R, — 0, 
which gives " 
^ l : 
Ree ET »20 (12.146) 


To compute the values for the a. D, and c,, we need to construct a numerical 
algorithm that will be suitable for minimizing £(E|0). The study of such algo- 
rithms is extensive; we will be content to present a frequently used one, based 
on a method of Newton. The essential concept is that given the K th estimate of 0, 
we wish to find a (K + 1)st estimate that will make €(E|@(K + 1)) smaller than 
£(E|0( K)). The method is to expand £ about 0( X) and choose 0(K + 1) so that 
the quadratic terms—the first three terms in the expansion of £—are minimized. 
Formally, we proceed as follows: Let 0(K + 1) = 0(K) + 60. Then 


£(E | 6(K + 1)) = KE]6(K) + 56) 


ee UM 
=c+g780 + 530 QÔ +, (12.147) 
where 
n ag 9?£ 
-WRI ev == 2o Qa 12.148 
c —(EJ0(K), g Q 3000 | . ( ) 
0=60(K) 9z8(K) 


16 See Luenberger (1973) for a lucid account of minimizing algorithms. 
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We must return later to the computation of g” and Q, but let us first construct 
the algorithm. We would like to select 50 so that the quadratic approximation to 
£ is as small as possible. The analytic condition for £ to be a minimum is that 
90£/880 = 0. We thus differentiate Eq. (12.147) and set the derivative to zero with 
the result (ignoring the higher-order terms in 50) 

ae 7 7 
359 ^ +80 Q 
zz 0; (12.149) 
Solving Eq. (12.149) for 60 we find 
60 = —Q^!g. (12.150) 
We now use the $0 found in Eq. (12.150) to compute 6(K + 1) as 
O(K + 1) = 6(K) — Q^'g. 
In terms of £ as given in Eq. (12.148), the algorithm can be written as 
aK +1) = 6K) - (24 "(EY (12.151) 
= 90 00 800/ ` f 


Our final task is to express the partial derivatives in Eq. (12.151) in terms 
of the observed signals y and u. To do this, we return to Eq. (12.145) and 
proceed formally, as follows, taking R, to be a constant because R, is given by 


Eq. (12.146) 
ELD = : ELLE posi e zc ew 
= XC (12.152) 
9^£ A ie Bee de(k) 
0000 30 FOX 30 


1 “(orth \* dae). 1 d7e(k) 
=e ` bh = ) ba (12:15 
R k=n 80 30 : R, kon d 90 90 i » 


We note that Eq. (12.152) and the first term in Eq. (12.153) depend only on the 
derivative of e with respect to 0. Because our algorithm is expected to produce an 
improvement only in 0, and because, near the minimum, we would expect the first 
derivative terms in Eq. (12.153) to dominate, we will simplify the algorithm to 
include only the first term in Eq. (12.153). Thus we need only compute ðe(k)/30. 
It is standard terminology to refer to these partial derivatives as the sensitivities 
of e with respect to 0. 
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We turn to the difference equations for v(k) Eq. (12.139), substitute e(k) for 
v(k), and compute the sensitivities as follows: 


de(k) _ 4 x ee) 
Um 730-9 DAT m (a) 
de(k) _ eee) 
m = —u(k — i) — D am (b) 
dekh) Bek) 
x ek) MA e - (c) — (12.154) 


i j=l i 


If we consider Eq. (12.154(a)) for the moment for fixed i, we see that this is a 
constant-coefficient difference equation in the variable de(k)/da, with y(k — i) as 


a forcing function. If we take the z-transform of this system and call de/da, = 5, 
then we find 


E, (2) — z"'Y() — » cz EQ) 


j=l 
Or 


zr 
Bee e M 
= rupe 


Thus the derivative of e with respect to a, is simply z^! times the partial derivative 
of e with respect to a,, and so on. In fact, we can realize all of these partial 
derivatives via the structure shown in Fig. 12.9. 

In exactly analogous fashion, dynamic systems, whose states are the sensitiv- 
ities of e with respect to b, and c,, can be constructed. With these then, we have all 
the elements of an explicit algorithm that can be used to compute improvements 
in 8. The steps can be summarized as follows: 


1. Select an initial parameter estimate, 6(0), based on analysis of the physical 
situation, the system step and frequency responses, cross-correlation of input 
and output, and/or least squares. 


2. Construct (compute) e(k) from Eq. (12.139) with e substituted for v; com- 
pute the sensitivities from Eq. (12.154) using three structures as shown in 
Fig. 12.9; and simultaneously compute R 32/30 from Eq. (12.152) and the 
first term of R 876/80 30 from Eq. (12.153). 


3. Compute R, from Eq. (12.146) and solve for g and Q. 
4. Compute 6(K + 1) = 6(K) — Q^'g. 
If [Ê (k +1)- Ê KY Ê (K) < 1074, stop. Else go back to Step 2. 


tn 
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Figure 12.9 


Block diagram of dynamic system whose states are the sensitivities of e with respect to a, 


In Step 5 of this algorithm it is suggested that when the sum of squares of 
the prediction errors, as given by R,, fails to be reduced by more than a relative 
amount of 1075, we should stop. This number, 1074, is suggested by a statistical 
test of the significance of the reduction. A discussion of such tests is beyond 
our scope here but can be found in Astrom and Eykhoff (1971), and Kendal and 
Stuart (1967). If the order of the system, n, is not known, this entire process can 
be done for n = 1,2, 3, ... , and a test similar to that of Step 5 can be used to 
decide when further increases in n are not significant. 


€ Example 12.5 


Identification of the Four Disk System 


To illustrate parameter identification with a non-trivial example, we consider the four disk 
mechanical set-up sketched in Fig 12.10 as studied by M. D. Sidman (1986). This is a repre- 
sentation of a physical system constructed in the Aeronautic Robotics Laboratory at Stanford 
University as part of a continuing study of the control of flexible mechanisms. In this case, 
the assembly has a torque motor coupled to disk 3 and angle sensors at disk 3 and at disk 
1. The sensor at disk 3 is therefore collocated with the torque source and the sensor at disk 
1 is non-collocated with the torque. All disks have the same inertia but, to permit parameter 
changes, provision is made to reduce the inertia of disk 4 by a factor of 0.5 or 0.25. Careful 
measurement of the transfer function by a one-frequency-at-a-time technique for each of the 
inertias of the disk was done and the magnitudes are plotted in Fig. 12.11. The problem is to 
estimate the transfer function by stochastic least squares. 


Figure 12.10 
A four disk system, from 
Sidman (1986) 


Figure 12.11 

Frequency response for 
the four disk system 
measured with no noise 
and non-collocated 
signals, from Sidman 
(1986) 
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Figure 12.12 
Frequency response of 
the four disk system 
found by least squares, 
from Sidman (1986) 


Figure 12.13 
Frequency weighting 
caused by the system 
denominator, a(z), from 
Sidman (1986) 
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Solution. If we write the 6, (z)/ U(z) transfer function as a ratio of two eight-degree poly- 
nomials, excite the system with a random signal, compute the least squares estimates of these 
parameters, and plot the resulting experimental frequency response, we obtain the curves 
shown in Fig. 12.12. 

The poor fit obtained by using least squares in the presence of very small data converter 
quantization noise, W_, at the plant output is obvious in Fig. 12.12. Because the flexible 
modes are not successfully identified, the estimates are worthless for the purpose of designing 
a controller that actively dampens these modes. The poor performance of least squares in 


Gain magnitude 
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Normalized frequency (radians) 
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this situation is caused by the fact that the equation error is not white. It is colored by the 
characteristic polynomial of the plant, a(z). Suppose we write the transfer function 


ro) = P yey + w. (12.155) 
a(z) 
Then the equation error is 
a(z)Y(z) — b(z)U(z) = a(z)W,. (12.156) 


Because a(z) is typically large at high frequencies, and amplifies the noise, the parameter 
estimates will be greatly distorted by the least squares identification. This has the effect of 
corrupting the frequency response estimates as shown in Fig. 12.12. In the present case, a 
plot of the true a(z) is shown in Fig. 12.13. If we filter the data with a(z) as suggested by 
Eq. (12.156), we obtain the excellent results shown in Fig. 12.14, However, this polynomial is 
not known and cannot be used to reduce the high frequency distortion. 

To reduce this effect, Sidman (1986) modified Clary's method and introduced the known 
parts and filtered the data with the structure shown in Fig. 12.15. The plant is divided into 
known and unknown parts, multiplied on both sides of Eq. (12.156) by the filter transfer 
function, F, and the new variables defined as 


U (2) = Ut) 4) Fee), (12.157) 
a, (2) 
Y,(z) = ¥(z) F(z). (12.158) 
Then the filtered least-squares formulation 
a, (z)[a, (2) ¥(z)] — b, (z)[b, (2) U(2)] = a, (z)a, (2) W, (12.159) 
reduces to the modified formulation 
a, (z) Y,(z) — b, (z)U,(z) = a, (2) F(z) W, (12.160) 


From these equations, one estimates the transfer function polynomials a, and b, of the un- 
known plant. However, as can be seen by comparing the right-hand sides of Eq. (12.159) and 
Eq. (12.160), this eliminates the coloration of the equation error due to the known a, and 
introduces a filtering of the noise that can be used to reduce the weighting due to a, in the 
estimates. In this example, a, is a very significant cause of coloration at high frequencies since 
it includes the action of two roots at z = | corresponding to the plant's rigid body poles. 

The bandpass filter F(z) is selected to have a passband that brackets the region where the 
resonances are known to exist, This frequency constraint prefilter further reduces the effect of 
high and low frequency noise and disturbances including broadband quantization noise, DC 
bias in the sensor and actuator, low-frequency torque disturbances, as well as excited unmodeled 
dynamics above the frequency band of interest, The high pass portion of the bandpass filter 
may be chosen to have its zeros at z — 1. This leads to a stable, simple combined prefilter that 
lacks undesirable low frequency behavior. In this example, b, consists of real zeros located on 
the negative real axis. The contribution to the system response due to the zeros of b, is very 
insensitive to changes in the inertia or torsional stiffness in the plant. Thus, the identification 
need only estimate one complex zero pair and three complex pole pairs of the four disk system. 

With the suggested pre-processing of the input and output data, the results of Fig. 12.16 
are obtained. This model is quite suitable for use in control or in adaptive control schemes, 
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Figure 12.14 
Frequency response 
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Figure 12.16 
Frequency response of 
four disk system using 
least squares with 
low-pass filters and 
known factors, from 
Sidman (1986) 
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Subspace Identification Methods 


The identification methods described thus far have been based on transfer function 
models. The nonparametric methods estimate the transfer function directly as a 
function of frequency and the parametric methods are based on representing 
the input-output relation as a rational function with unknown coefficients for 
the numerator and denominator polynomials. An alternative formulation of the 
identification problem is to express the input-state-output relationships in state 
space form with unknown description matrices ®, I’, H, and J. In addition to 
these matrices, the (unknown) state sequence, x(k), is introduced as well. At first 
blush, it would seem that this structure greatly complicates the problem as there 
are many more parameters to be considered as compared to the transfer function 
model. The approach is to express the data in an expanded space guaranteed 
to contain the state description and to extract the subspace consistent with the 
state equations using reliable and robust methods of numerical linear algebra. 
The methods have been to shown to have very desirable statistical properties 
and, perhaps most important, to be directly applicable to multivariable systems 
with vector inputs and outputs. The subspace-based methods have their origins 
in the early work toward finding a state realization from a deterministic impulse 
response as described in Ho (1966). A unification of the field has been given 
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in Overshee (1995) where the method described here (based on Cho (1993)) is 
called a prediction method. 

The method begins with a collection of input and output data u(k) and y(k) 
that we assume are scalars but could just as well be vectors for this method. To 
motivate the development of the algorithm, it is assumed that this data comes 
from a noise-free underlying state system given by equations of the form 


x(k + 1) = x(k) + Pu(k) 
y(k) = Hx(k) + Ju(k) (12.161) 
where the (unknown) dimension of the state is n . In order to express the constraints 
imposed on the input and output sequences by Eq. (12.161), a number, M, known 


to be larger than the state dimension but much smaller than the data length, is 
selected and the data are organized into the matrices Y and U of size M x N as 


follows 
Xo Xp "SE ur. 
y y wee y 
21-3 P ME  “" (12.162) 
Yuen Wa 7 XYuaar-a 
and 
Uy uy Uy- 
u, u, Uy 
U= f ] (12.163) 
Uy) Uy ` Unan—o 


We assume that the input signals are "exciting" in the sense that u(k) contains 
many frequencies and, in consequence, the matrix U has full rank = M. To these 
data matrices we add the n x N matrix of states 


|| ter Xy | (12.164) 
Finally, we define composite matrices made from the description matrices as 
H 
Ho 
Oo =|. (12.165) 
Ho" 
and 
J 0 0 0 0 
Hr J 0. 0 0 
H=| HOP HF J 0 0 (12.166) 


HPT HOMT J 
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The M x n matrix Ó, is the observability matrix with n independent columns and 
the M x M matrix ‘H is an impulse response matrix. With these constructions 
of the data and definitions of matrices, the input-state-output relations can be 
compactly expressed as 


Y — O.X - "KU, (12.167) 


and the unknown model is contained in the matrices ©, and H. 

The next step is to remove the term in H by multiplying on the right by a 
matrix that is perpendicular to U, which can be found by computing the singular- 
value decomposition (S VD) of U partitioned as follows 


T 
U-[P, Pa ][ 3, o]| g | =P, X Qj. (12.168) 


It is a property of the SVD that the matrices P, and Q, are orthogonal matrices 
and consequently that the product Q7,Q_, = 0. Thus if we multiply Eq. (12.167) 
on the right by Q, it is reduced to 


YQ, = O,XQ,,. (12.169) 


u2 


The matrix on the right side of Eq. (12.169) is a combination of the columns 
of the observability matrix in the space of the state x and has n independent 
columns. These can be identified by again taking an SVD, this time of YQ, by 
computing matrices P, S, and Q so that 


5 

YQ, = PSQ’ - [ P, Pall oll ar |: (12.170) 
The number of independent columns in this product are the columns of P, and 
equals the number of nonzero singular values in S which is the size of & which 
is therefore by definition n x n. Comparing this result with Eq. (12.169) means 
that we can take P, = O, from which we can solve for H and ®. If we wish 
to consider another state than x via a transformation T according to x = Tz we 
would have O, =O, T and P, = O.T"'. 

With O, given by Eq. (12.165), the matrix H is the first row. "7 We next solve 
for ® using the properties of © given by Eq. (12.165). If we drop the last row of 
O. and call that reduced matrix O, and drop the first row and call that ©, then 
itis clear that O, = O.® from which we can solve for 

©=[0,]'o (12.171) 


— ? 


where [.]' represents the pseudo inverse and computes a least squares solution. '? 


17 In the multivariable case, we'd take H as the first n; rows. 


18 Some authors suggest using total least squares rather than ordinary least squares, as both CO. and O, 
are subject to noise and uncertainty in real cases. 
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Having H and ®, we now turn to computing F and J by operating on 
Eq. (12.167) to isolate +t. To do this we multiply Eq. (12.167) on the right by 
the pseudo inverse Ut (which is readily found from Eq. (12.168)) and on the left 
by P7,. Since by definition UU' = I and P,O, = P7,P,, = 0, these operations 
reduce the equation to 


ul 


PLYU' = PZH. C2472) 


Because I‘ and J appear on the right in H in a linear fashion, we can solve for 
them by rearranging the terms in the equation. We partition the matrices in this 
equation consistent with H and write it as 


J 0 > 0 
HT J => 0 
[K i : 


ox 9 Ky ] = [5i o sity b] 


Hr ... HF J 
(12.173) 


To isolate the unknowns J and F these equations can be reorganized and written 
as 


K, Py Py coo Py 

K, Pa cop. 0 10 J 
she: eel] «ne 
Ke Pu 0 e 0 


From this equation, one can solve for J and I by, for example, least squares 
again. This completes one variation of the prediction subspace identification 
algorithm. Clearly there are a substantial number of choices the engineer needs 
to make, including the selection of M and the selection of least squares or total 
least squares to solve Eq. (12.171) for ® and Eq. (12.174) for F and J. Perhaps 
more important than these decisions is the choice of n from Eq. (12.170). As 
given, we have ignored the possibility of noise which is always present in any 
practical case. Thus in practice the dimension of X in Eq. (12.170) is not obvious 
but must be selected as that value such that the significant singular values are 
kept and the negligible singular values are dropped. How clearly this choice is 
depends on the particular case and the other choices made in the algorithm and 
can be quite difficult. In important cases, models based on the prediction error 
method as well as one based on subspace methods are frequently computed and 
tested against new data and physical knowledge to guide final model selection. 


Summary 


In this chapter we have introduced some of the concepts of identification of 
dynamic systems for the purposes of control design. The main points were 
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Data to be used for identification should be filtered to reduce noise and to 
remove known components of the transfer function. 

A good frequency response model can be computed using sinusoidal inputs, 
one frequency at a time. 

A chirp signal input coupled with the FFT permits one to obtain a frequency 
response model in one pass. 

Either batch or recursive least squares gives a good model when the equation 
error is an independent white noise process. 

Prediction error methods based on the maximum likelihood are among the 
best general methods for identification. 

State subspace-based methods estimate state realizations directly and are 
especially effective for multi-input-multi-output systems. 


12.10 Problems 


Figure 12.17 
A mechanical system 
used in Problem 12.2 


12.1 Data from a mass moving in a plane (x, y) without force are given as 


e 


—0.426 1.501 
0.884 1.777 
2.414 2.530 
2.964 3.441 
3.550 3.762 
5.270 4.328 
5.625 4.016 
7.188 5.368 
8.129 5.736 
9.225 6.499 

10 10.388 6.233 


OMmMAYNDNHWN 


Plot the points and use least squares to estimate the initial value of x, the initial value of 
y, and the velocity of the mass in x and in y. Draw the curve (line) that corresponds to 
your estimates. 


12.2 A simple mechanical system without friction is sketched in Fig. 12.17. 
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12.3 


12.4 


(a) 


(b) 


(c) 


(d) 


(e) 


(f) 


(g) 


(h) 


Show that this system is described by the equations 
Mi, +k x, +k (x, — 2x) 2-0, x00 = xo 
MX, +k x, +k, (x, — x) = 0 x, (0) = Xe 
We wish to estimate the parameters k,,k,, and M from measurements of x, (r). 


However, note from (a) that if we divide by M in each equation the only identifiable 
parameters are k,/M = K, and k,/M = K,. Give an intuitive argument to the 


effect that, if x,, = x9 or if x,y = —x9, it is impossible to estimate both K, and 
K, from x, (£). (Hint: Rewrite the equations using y = x, — x, and z = x, + x, as 
the position variables.) Show that if Xy = l and Xo = 0, it should be possible to 


estimate both K, and K, from x, (£). What do you conclude makes a “good” initial 
condition? 


Compute a discrete equivalent system for sampling at 0.5 sec with an input that 
can set an initial condition on x,. Select K, — 0.4 and K, — 0.6. Compute the 
transform of X, if xj, = 1 


baz + biz? + bz +b,z + b, 


Xz) = , 
z +a z? taz -azta, 


where the parameters a, are functions of K,, K,, and sampling period T. 


Formulate the least-squares problem to estimate the a, from noisy measurements 
of x (kT). Set up the problem so that it is possible to ignore the 5;. 


Simulate the equations of part (a) for K = 0.4 and K, = 0.6, sample x, at arate of 
0.5 sec for 30 samples, and compute 6= (à à, à nu. Assume all the noise 
comes from an A/D converter operating at 10 bits including sign. If the simulation 
is done on a digital computer, add the appropriate noise according to the noise 

model of quantization discussed in Chapter 10. Compute the predicted variance in 


the estimate of 0. 


Deliberately add additional noise of known variance to your data, recompute 50 
times from different runs of noisy data, and compare the sample variance of 0 to 
the theoretical value based on P. 


Keep the number of samples fixed at 30 but vary the sample rate from 0.1 sec to 1 
sec in 0.1 steps and compute 


l 4 


12 


k=l 


à o 
0, — 9, 
0; 

for each sample period as a measure of total estimate accuracy. What do you 
conclude respecting selection of sample period for identification? 


Keep the sample period fixed at 0.5 sec, but compute the estimate for varying 
numbers of samples. Consider at least 5, 10, 30, 100, and 300 samples. Compare 
the same criterion used in part (g) and give an explanation of the results. 


Program the recursive least-squares algorithm of Eq. (12.73) for no weighting. Use the 
algorithm to estimate the parameters of the system of Problem 12.2(h). Use 0(5) = 0 
and P(5) = I for your initial conditions. If available, use the Identification Toolbox to 
repeat the exercise. 


Give the transfer function H (z) of the filter that corresponds to exponentially weighted 
least squares as a function of a and y. Prove that H(1) = l ifa =1-— y. 


12.5 


12.6 


12.7 


12.8 


12.9 
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In Eq. (12.84) we showed that the variance in the least-squares estimate is 
(A’ A)! AT RA(A™A)~! and, in Eq. (12.108), we showed that the variance of the BLUE 
is (A7 R^! A)7!. Use the development of the proof of the BLUE following Eq. (12.103) 
to devise an expression forthe excess variance of least squares over BLUE. 


(Contributed by N. Gupta.) Show that the least-squares estimate 6 = (A7A)"'ATY 
results in the error squared E(Y — A8)" (Y — A8) = (m — n)c?, where Ee? = o?, Y 
has m components, and 60 has » components. What estimate of c? does this result 
suggest? Hint: If M = I — A(ATA)*!A7, then M? = M = M7. 
In Eq. (12.134) we showed that the maximum likelihood estimate of c? is 
(Y — A8)" (Y — AÔ)/m. 
Show that this estimate is biased. 


Write a computer program to implement the search for the maximum likelihood estimate 
following the method of Section 12.8. 


Simulate the system [described in Astrom and Eykhoff (1971)] 
T c, v, , where u, and v 


(a) y, = 4Y, t bu, + yy b eve k k 
of unit variances and a, = —0.5, b, = 1.0, c, = 0.1. 


are independent sequences 


(b) Compute the least-squaresestimate of a, and b, from observation of 5, 50, and 500 
samples. Compare variance and bias to the theory in each case. 


(c) Compute the maximum-likelihood estimates of a,, b,, c,, and o2 from 5, 50, and 
500 samples, and compare the estimates to the known true values. 


12.10 Suppose we wish to identify a plant that is operating in a closed loop as 


Xue) = ayit bunt eg 
u, = -Kyy 
where e, is white noise. Show that we cannot identify a and b from observation of y and 
u, even if K is known. 
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Nonlinear Control 


Perspective on Nonlinear Control 


Every physical system is nonlinear for large signals, and many systems have 
important nonlinearities such as friction that cannot be ignored even for small 
signals. Furthermore, optimal control of many systems requires a nonlinear con- 
troller. Because the variety of nonlinear systems is so vast, it is only possible 
here to give an introduction to the most important issues and concepts.’ The first 
concern is stability, and the most important stability theory is that of Lyapunov. 
It is this theory that justifies the enormous attention paid to the linear case by 
providing the proof that, for most nonlinear systems, stability of the small signal 
linear approximation implies a region of stability of the nonlinear system. Be- 
yond this result, Lyapunov theory gives us a tool that can be used to determine 
further regions of stability and also can guide the design of purposely nonlinear 
controllers in a process called Lyapunov redesign. Unfortunately, application of 
Lyapunov theory is often difficult and can be very frustrating. Alternative ap- 
proaches, including computer simulation, heuristic methods such as the describ- 
ing function, and the frequency-response-based circle criterion are also important 
techniques. 

Design of controls for nonlinear systems may be placed in several categories. 
The most primitive method is to obtain a linear approximation, design a linear 
controller, and use computer simulation and perhaps a stability theory such as 
Lyapunov’s to explore nonlinear behavior. In an approach known as Lyapunoy 
redesign the engineer constructs a tentative Lyapunov function and deliberately 
designs the controller to force the system to realize the function which automat- 
ically provides a proof that the system is stable. Another widely used approach 
is to use feedback to reduce the equations to a linear form and continue as in 


| In this text we assume that the mode! equations have unique solutions and do not exhibit the astonishing 
behavior of Chaos (Gleick, 1987). 
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the first case. This method is known in the field of robotics as the method of 
computed torque. A combination of Lyapunov redesign and computed torque 
that extends both has recently been developed as backstepping. Another ap- 
proach to nonlinear design is to apply the concepts of optimal control to obtain a 
controller that minimizes a performance measure. A well-known case is that of 
minimal time control of a linear system subject to bounded controls. A practical 
approximation to minimal time control has been developed as proximate time 
optimal systems (PTOS). Finally, a widely used concept is that of adaptive con- 
trol. The fundamental idea of adaptive control is that the controller is redesigned 
on-line to maintain good performance in the face of a changing environment that 
includes major changes in the plant dynamics. Sub-categories of adaptive control 
are gain scheduling, the model-reference adaptive control (MRAC), and the 
self-tuning regulator (STR). 


Chapter Overview 


In this chapter, we first consider computer simulation and the heuristic analysis 
methods of equivalent gain and the describing function. For stability, the main 
analysis technique presented is the second method of Lyapunov, but also discussed 
are the circle criterion and the small gain theorem. An important design technique 
uses linear design methods based on a linear approximation to the plant model. 
As we will see, this approach is known as Lyapunov's first method. In Section 
13.2 we consider two approaches to design by nonlinear feedback. The first is 
the proximate time optimalsystem (PTOS) design for linearsystems having only 
saturation of the actuator as the nonlinearity, and the second is the self-tuning 
regulator approach to adaptive control. PTOS control is especially appropriate for 
motion control and adaptive control is widely used in chemical process control. 
In the last section of the chapter, several important approaches to computer-aided 
design of nonlinear systems are introduced. 


To obtain the remainder of this chapter, send a request to: 
elliskagle? gmail.com 


Design of a Disk Drive Servo: 
A Case Study 


Perspective on the Design Process 


To illustrate many of the concepts developed in this book, the design of a disk 
drive head positioning servomechanism provides an excellent case study. In most 
textbook examples, the problems are highly simplified for pedagogy: Too much 
detail often obscures the underlying point. Indeed, most of this text has followed 
this principle and many of the individual problems of this case study are used 
to illustrate various parts of the design process in earlier chapters. However, for 
a final design example we will not spare all of the detail but will present some 
of the difficult realities of a real problem. The problem requires constructing 
models of the plant, the actuator, and the sensor and verifying these models with 
simulations. Next comes determining the specifications both for the components 
and for the entire system. Following the problem formulation comes the design 
of a small-signal linear controller and finally the design is completed with a 
nonlinear controller for handling large signal behavior. The last step is testing the 
design in simulation for its performance in the presence of varying parameters 
before constructing a prototype and testing performance by experiment. We begin 
the case study by giving a description of the disk drive which will serve to set 
the stage for the specification development in our design, and this too will be in 
more depth than in our typical examples. 


Chapter Overview 


A general discussion of disk drives and current performance specifications are 
given in Section 14.1. In Section 14.2 the several components that comprise 
the disk drive servo are described and models given. The specific performance 
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specifications for the servo case study are given in Section 14.3. Finally, the 
details of the design are presented in Section 14.4. As with earlier chapters, this 
ends with a set of problems designed to enhance the learning process. 


To obtain the remainder of this chapter, send a request to: 
elliskagle@gmail.com 


Examples 


A.1 


In this appendix we describe five control problems which are used to illustrate 
the analysis and design techniques developed in the text. These are collected here 
in order to permit references to them at a number of places in the text without the 
need to repeat the derivation of the model or discuss the specifications each time 
the example is used. 


Single-Axis Satellite Attitude Control 


Satellites often require attitude control for proper orientation of antennas and 
sensors with respect to the earth. Figure A.1 shows a communications satellite 
with a three-axis attitude-control system. To gain insight into the three-axis 
problem we often consider one axis at a time. Figure A.2 depicts this case where 
motion is allowed only about an axis perpendicular to the page. The equations of 
motion of the system are given by 


16 =M, +M, (A.1) 


where / is the moment of inertia of the satellite about its mass center, M c is the 
control torque applied by the thrusters, M, are the disturbance torques, and 8 is 
the angle of the satellite axis with respect to an “inertial” reference. The inertial 
reference must have no angular acceleration. Normalizing, we define 


u = Mçc/l, w,=M,/I (A.2) 
and obtain 


Ö=u+ Wy (A.3) 
689 
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Figure A.1 
Communications 
satellite (courtesy Space 
Systems-Loral) 


Figure A.2 
Satellite-control 


schematic Inertial 
reference 


\/ 


Taking the Laplace transform 
(s) = EO +w,(s)], (A.4) 
which becomes, with no disturbance 
Ae b comi. (A.5) 


uls) s? 


A.2 


Figure A.3 

Satellite tracking 
antenna (courtesy Space 
Systems-Loral) 
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In the discrete case with u being applied through a zero-order hold, we can use 
the methods of Chapter 4 to obtain the discrete transfer function 


_ 6(z) 
G,(z) = Tm 
Fea 
E» md (A.6) 


A Servomechanism for Antenna 
Azimuth Control 


It is desired to control the elevation of an antenna designed to track a satellite 
as shown in Fig. A.3. The antenna and drive parts have a moment of inertia J 
and damping B arising to some extent from bearing and aerodynamic friction, 
but mostly from the back emf of the DC-drive motor. A schematic is shown in 
Fig. A.4. 

The equation of motion is 


J6+BO=T.+T,, (A.7) 
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Figure A.4 
Schematic diagram of 
antenna 


where T, is the net torque from the drive motor and T, is the disturbance torque 


due to wind. If we define 
B/J =a, u=T,/B, w, = T,/B, 
the equation reduces to 
fee +s 
-0 +0 =u+ w; 
a 
Transformed, it is 


(s) [u(s) + w,(s)] 


l 
~ s(s/a+1) 
or, with no disturbances, 
0 (s) A l 
u(s) ^ s(s/a +1) 
The discrete case with u(k) applied through a zero-order hold yields 
Ee s (z + b) 


= G,(s). 


G,(z) = -K——————, 
a u(z) (z — D(z — e^) 
where 
gc ca ies _ li estt—ahet! 


a : ~arelte 


(A.8) 


(A.9) 


(A.10) 


(A.11) 


Figure A.5 
General shape of the 


command angle and its 


first two derivatives, 
which the tracking 
antenna must follow 
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The model given by Eq. (A.7) is only a linear approximation to the true motion, 
of course. In reality there are many nonlinear effects that must be considered in 
the final design. To give some ideas of these considerations, we present the major 
features of the angle, 6,, of the satellite to be followed and of the servomechanism 
that must be designed to achieve the tracking. 

The general shape of 8, (t) and its velocity and acceleration are sketched in 
Fig. A.5. The peak values of velocity and acceleration depend critically on the 
altitude of the satellite above the earth and on the elevation above the horizon of 
the orbit seen from the antenna. Anorbit of 200 miles above the earth that passes 
at an elevation above the antenna of 86? requires a maximum azimuth rate of 
0.34 rad/sec (about 20°/sec). For the purpose of setting the tracking accuracy and 
gains, a reasonable assumption is that the antenna should be capable of following 
this peak velocity in the steady state with acceptable error. 

The size of the allowable tracking error is determined by the antenna proper- 
ties. The purpose, of course, is to permit acceptable communication signals to be 
received by the antenna electronics. For this, we must consider the dependence 
of signal amplitude on pointing error. A sketch of a typical pattern is shown 
in Fig. A.6. The beam width A0 is the range of tracking error permissible if 
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Figure A.6 


Plot of signal power 
received as a function of 


tracking error 


A.3 


DALAL 


A0 


acceptable communications are to be achieved. Typical values of the beam width 
can vary from a few degrees (0.1 rad) to less than one degree (0.01 rad). The 
total error will be composed mainly of tracking errors due to 0, motion, wind- 
gust errors, and random errors caused by noise in the measurement of satellite 
position. A reasonable allowance is to permit tracking errors of 10% of the beam 
width and an equal contribution from wind gusts. For a beam width of 0.1 rad, 
then, we allow a tracking error of 0.01 rad, from which it follows that K, = 1 (if 
6 ax = 0.01 rad/sec). For a 60-ft. tracking antenna, a typical beam width is 0.01 
rad, and if 0.34 rad/sec must be followed, the velocity constant for a 10% error 
(0.001 rad) is K, = 0.34/0.001 = 340. For a velocity constant of this magnitude, 
a conversion to a Type 2 system with two poles at z = | is probably required. 

In addition to considerations of tracking error, the designer must take into 
account the fact that the drive motor has finite torque and power capability. In 
selecting the drive motor, one must keep in mind that the total torque must be 
capable of meeting the acceleration demands of tracking, overcoming the wind 
torque, and overcoming the static friction of the drive-train gears and antenna 
mount. The power of the motor must be able to supply this torque at the maximum 
tracking velocity. 


Temperature Control of Fluid in a Tank 


The temperature of a tank of fluid with a constant flow rate in and out is to be 
controlled by adjusting the temperature of the incoming fluid. The temperature 
of the incoming fluid is controlled by a mixing valve that adjusts the relative 
amounts of hot and cold supplies of the fluid (Fig. A.7). Because of the distance 
between the valve and the point of discharge into the tank, there is a time delay 
between the application of a change in the mixing valve and the discharge of the 
flow with the changed temperature into the tank. 
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Figure A.7 
Tank temperature control 


Negligible heat loss in walls 


Tank fluid at 
temperature 7, 


The differential equation governing the tank temperature is 
: 1 
T = — (q, — ; A.12 
e cM (in Tiga) ( ) 


where 


T, = tank temperature, 
c = specific heat of the fluid, 
M = fluid mass contained in the tank, 


Gin = ema T, 
Jou = cm, Ts 

m = mass flow rate (= m, = Mu) and 
T, = temperature of fluid entering tank; 


but the temperature at the input to the tank at time ¢ is the control temperature r, 
sec in the past, which can be expressed as 


T.) = T, (t — x), (A.13) 
where 
t= delay time, and 
T. = temperature of fluid immediately after the control valve and directly 


controllable by the valve. 
Combining constants, we obtain 


f. (t) + aT, (t) = aT, (t — T), (A.14) 
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where 
a=m/M, 


which, transformed, becomes 


ES eee A.15 
TE) Jakk D (atem 


ec 


To form the discrete transfer function of G, preceded by a zero-order hold, 
we must compute 


-Ts 


l—e e 
G -E,———————i. 
ae) | s s/a+! | 


We assume that t, = IT — mT, 0 < m < 1. Then 


jns e ^5 eg IsgnTs 
G zu ———— 
3(2) | s  sfat+l | 
mTs 
= (l— = 17 nd c E 
eae | +1) 
mTs mTs 
-(ü-z'z [: == | 
S s+a 
=| 


1 m 
= = gait + mT) —e "1 +mT)} 


E z—1 1 z ently 
z zdz-l1 z-e*" 


1 (d = ery: 4+ eon rat 
“faa 
i e "T z+a 
e anT ie erat 
a= loe (A.16) 


From Eq. (A.16) it is easy to see that the zero location, —o, varies from œ = oo 
at m — 0 to æ = 0 as m — | and that G,(1) = 1.0 for all a, m, and /. 

For the specific values of t,=1.5, T = l, a = 1, Eq. (A.16) reduces to 
ipo. m= 4) 


z + 0.6065 


(GE nee 
i = 6 — 0.3679) 


A.4 


Figure A.8 
Double-mass spring 
system 
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Control Through a Flexible Structure 


Many controlled systems have some structural flexibility in some portion of the 
system. The spacecraft of Fig. A.1 may not be perfectly rigid, the angle tracker 
of Fig. A.3 may have some flexibility between the angle observed by the antenna 
and the angle of the base, and almost any mechanical system such as a robot arm 
or adisk drive read/write head assembly would exhibit some degree of structural 
flexibility. 

Conceptually, these systems are equivalent to the double mass—spring device 
shown in Fig. A.8. The equations of motion are 


My+(y—d)b+(y—d)k =u, 
md + (d — ý)b + (d — y)k = 8, (A.17) 


which, when transformed, become, 


Ms? +bs+k —(bs +k) y(s) l 

l —(bs +k) ms? + bs +k | E: l T E IE (A18) 
Two transfer functions are of interest: one between u and d, the other between 
u and y. The first represents the case where there is structural flexibility be- 
tween the sensor and the actuator, a situation called the noncollocated case. The 
second represents the collocated case where the sensor is placed on the same 
rigid body as the actuator, but there exists a mechanical oscillation elsewhere 
in the system that is coupled to the mass to which the actuator and sensor are 
attached. 

Using Eq. (A.18) we can obtain both transfer functions 


(8) 
d(s) = 1 m m (A.19) 


u() M ol» my [b kN? 
sfs + (14%) (žE) 


m, M = masses 
b = damping coefficient 
k = spring constant 
F - applied control force 
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Figure A.9 

Root locus for double 
mass-spring system with 
flexibility between 
sensor and actuator. The 
noncollocated case 


which becomes for the typical case with very low damping 


d(s) = 4. k/m _ 
as) = epee Ju à ure G,(s) (A.20) 
S |S + (1+) (Lez 
and 
z b k 
) l S PUT iren 
WU, es Cm nere c Uu) (A.21) 


4 Malin my. t 
S [s +(1+5) (2: 2)] 


For m/ M < 1, Eq. (A.21) indicates that the system dynamics are essentially 
1/s?, but there are poles and zeros at the structural bending-mode frequency 
that almost cancel each other. A small value of m/M in this simplified example 
represents the fact that the flexibility is not a dominant response to the control 
input u and 1/s? is a good model for many such problems. 

Equation (A.20) always has the 1/5? (“rigid-body”) poles plus the resonance 
poles without the neighboring zeros of Eq. (A.21). The noncollocated caseis the 
harder one to control for if a lead compensation is used to stabilize Eq. (A.20), 
the structural mode is destabilized, as shown in the root locus of Fig. A.9, and 
depending on the feedback gain and original structural damping, the structural 
mode could well be driven unstable. 


€ Closed loop 
root locations 
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A.5 Control of a Pressurized Flow Box 


This example is multivariable because it has two control variables and two mea- 
surable states. The basic function of the system is to keep the paper-stock flow 
out the bottom opening (slice) (Fig. A.10) at a consistent rate. The differential 
equations of motion are ' 


H 03 SNL +1 H 0 1 3 
h |e| ws o0 0 h |+| D 07 | |. 
ù 0 0 -1 u, 1 0 s 


H = h + p/p,g = total head perturbation, 
p = air pressure, 

p = mean air pressure, 

h = stock-level perturbation, 


u, = perturbation in air-valve opening, 
u, = command value to air valve, and 
u = perturbation in stock-valve opening. 


Note from Eq. (A.22) that there is two-way coupling between the two outputs, 
H and h, and that u, affects both states; therefore, it is difficult to decouple the 
system so as to influence either H or h alone with a single control. The third 


Figure A.10 
Pressurized flow box 


Stock is deposited on 
endless belt or “wire” 


Wet paper going 
to drying stage 


| These numbers are mainly contrived and might not correspond to any real head box, alive or dead. 
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equation in Eq. (A.22) represents dynamics of the air-valve actuator, which is 
such that u, does not respond instantaneously to an air command u. The transfer 
functions between u, and u, and H and h are 


l fs (s + 1)(s + 0.07) 
A( | -0.05 0.7(s + Ds --0.13) 


where A(s) = (s + 1)(s + 0.1707) (s + 0.02929). 


Gs) = 


B 


Tables 


B.1 Properties of z-Transforms 


Let F(s) be the Laplace transform of f, (t) and F,(z) be the z-transform of f (kT) 


Table B.1 
Number Laplace Transform Samples z-Transform Comment 
= F(s) F&T) F(z) 
] aJ. (s) + BF,(s) af (KT)+ BS, (kT) a F (z)-B F(z) The z-transform is linear 
2 F fe (s) 2v. FUTIS AT ET) FONR Discrete convolution cor- 
EE responds to product of 
z-transforms 
et"T! F(s) f(kT nT) 2" F(z) Shift in time 
4 F(s +a) era RG) F(e*7z) Shift in frequency 
5 —- lim f(kT) lim(z — 1) F(z) If all poles of (z — 1) F(z) 
k—oo 2] eee d 
are inside the unit circle and 
F(z) converges for 1 < |z} 
6 F(s/w,) f(w,kT) F(z; w,T) Time and frequency scaling 
1 d 
7 — SAT) AKT) a $ F (6) FOE Time product 
7 ve; 
8 Fs) = FOF) Jf ORE — r)dt Continuous convolution 


F; (2) 


does not correspond to 
product of z-transforms 
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B.2 Table of z-Transforms 


F(s) is the Laplace transform of f(t) and F(z) is the z-transform of f (kT). 
Unless otherwise noted, f(t) = 0, £ < O and the region of convergence of F(z) 
is outside a circle r < |z| such that all poles of F(z) are inside r. 


Table B.2 
Number F(s) S(kT) F(z) 
l = 1, k =0; 0, k #0 l 
2 = l, k=m; 0, kžm ore 
3 1 LT) —— 
5 z-i 
l T2 
4 = kT — 
Ü ( - 10 
l ME T! zz + 1) 
Se —(kTY. a 
3 s? 2D EGI 
l 1 T? zz +42 +1) 
6 = = (kT) a a 
E aa E (uj 
; 1 v Elo es p Eom a 
s" a0 (m — 1)! 9a"! a0 (m — 1)! da"! z — eT 
l z 
8 -akT 
s+a : toph 
l Tze *? 
9 ——À kTe "T ——__; 
(s +a)? ELEC 
l 1 io Té z(z e") 
10 — z (kTYe-? —e IL—M—— 
(s - ay 2 ( 2 (z—e Ty 
EN m-l m-1 | m~! gn"! 
" 1 (-1) a (eT) ROS £ ; 
(s +a)” (m — 1)! da"! (m = 1)! 3a" z= e°? 
12 : 1 — eat Ss 


s(s +a) =De eT) 


Table B.2 


Number 


13 


20 


21 
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JKT) 


F(s) 

O 

s? (s +a) 
b-a 


(s +a)(s +b) 
" 
(s +a) 
a? 
s(s +a)? 
(b —a)s 
(s+ ay(s +b) 
a 
+a? 
Ss 
s +a? 
sta 
(s +a)? +b? 
b 
a? +b? 


s((s - ay + b?) 


l 

-(akT — 1 + e **T) 

a 

(e"**T _ pbk 

(1 — akT )e~*T 

1 — e7**" (1 + akT) 
eg PKT = ae ^T 

sin akT 

cos akT 


e **T cos bkT 


e™™T sin bkT 


| eT (sos DRT + sin BRT 


b 


F(z) 


) 


zllaT — 1+ e*")z + (1 — eT — aTe-7 y] 
a(z — 1)*(z — e ?T) 
(eT E e T )z 

(z = e727" )(z —e*7) 
z[z — e?" (1- aT)] 

(z E exin) 
z[z(1 — e?" — aTe™*") +e7%T — eT + aTe] 

(z — 1)(z — ey 

zlz(b — a) — (be *" —ae™")) 

(eT); — e) 

z sin aT 
z? — (2 cos aT)z+1 
z(z — cos aT) 

z? — (2 cos aT)z + d 

z(z — e?! cos bT) 
z? — 2e?" (cos bT)z + e722" 

ze?! sin bT 
z? — 2e^*" (cos bT)z + e?" 
z(Az + B) 

(z — 1)? — 2e"! (cos bT)z + e ?"") 


A-2I—e"* cos bT — ze% sin bT 


Bae 4 ; e7*T sin bT — e?" cos bT 


C: 


A Few Results from Matrix Analysis 


C.1 


Although we assume the reader has some acquaintance with linear equations 
and determinants, there are a few results of a more advanced character that 
even elementary control-system theory requires, and these are collected here for 
reference in the text. For further study, a good choice is Strang (1976). 


Determinants and the Matrix Inverse 


The determinant of a product of two square matrices is the product of their 
determinants 


det(AB) = det A det B. (C.1) 


If a matrix is diagonal, then the determinant is the product of the elements on the 
diagonal. 

If the matrix is partitioned with square elements on the main diagonal, then 
an extension of this result applies, namely, 


det l 5 H | — det A det C, if A and C are square submatrices. (C.2) 


Suppose A is a matrix of dimensions m x n and B is of dimension n x m. 
Let L, and I, be the identity matrices of size m x m and n x n, respectively. Then 


det [I, + BA] = det[I, + AB]. (C.3) 


To show this result, we consider the determinant of the matrix product 


I, 0 I Ap |i. I A B 
is | B I, I -B I, |= aer E I, + BA | = det [I, + BA]. 
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But this is also equal to 


I, —A L A = E, T AB 0 — 
aer |% I IE I ]-«[ _B I ] eet, «am. 


n n n 


and therefore these two determinants are equal to each other, which is Eq. (C.3). 
If the determinant of a matrix A is not zero, then we can define a related 
matrix A~', called “A inverse,” which has the property that 


AA" = A'A =I. (C.4) 
According to property Eq. (C.1) we have 
det AA"! = det A- det A^! = 1, 
Or 


det A^! = - 
det A 


It can be shown that there is an n x n matrix called the adjugate of A with 
elements composed of sums of products of the elements of A' and having the 
property that 


A -adj A = det A . I. (C.5) 
Thus, if the determinant of À is not zero, the inverse of A is given by 
-,  adjA 
~ det A` 


A famous and useful formula for the inverse of a combination of matrices has 
come to be called the matrix inversion lemma in the control literature. It arises 
in the development of recursive algorithms for estimation, as found in Chapter 9. 
The formula is as follows: If det A, det C, and det (A + BCD) are different from 
zero, then we have the matrix inversion lemma: 


(A+ BCD)"! = A! — A^B(C^! + DA™'B)“'DAT!. (C.6) 
The truth of Eq. (C.6) is readily confirmed if we multiply both sides by A + BCD 
to obtain 
I = I + BCDA™ — B(C' + DA“'B)'DA™! 
—BCDA !B(C' + DA™'B)"'DA™! 
= I + BCDA`™' - [B+ BCDA ' BI[C^' + DA™'B)'DA™. 


If we subtract I from both sides and factor BC from the left on the third term, we 
find 


0 = BCDA"' — BC[C'' + DA! B]JIC^' + DAB] "'DA™, 


1 If A isthe n — 1 x n — 1 matrix (minor) found by deleting row i and column j from A, then the entry 
in row i and column j of the adj A is (—1)'*/ det A”. 
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which is 


0=0, which was to be demonstrated. 


Eigenvalues and Eigenvectors 


We consider the discrete dynamic system 


(C.7) 


3: al 
es| Wb 26 
o- [13] ay 
If we assume that it is possible for this system to have a motion given by a 


geometric series such as z*, we can assume that there is a vector v so that X, can 
be written 


X; =z", (C.9) 


Substituting Eq. (C.9) into Eq. (C.7), we must find the vector v and the number 
z such that 


v2 = dy;*, 
or, multiplying by z~“ yields 
vz = Èv. (C.10) 
If we collect both the terms of Eq. (C.10) on the left, we find 
(zI — d)v = 0. (C.11) 


These linear equations have a solution for a nontrivial v if and only if the de- 
terminant of the coefficient matrix is zero. This determinant is a polynomial of 
degree n in z (® is an n x n matrix) called the characteristic polynomial of 
6b, and values of z for which the characteristic polynomial is zero are roots of 
the characteristic equation and are called eigenvalues of ®. For example, for the 
matrix given in Eq. (C.8) the characteristic polynomial is 


eiii s 


Adding the two matrices, we find 


A 1 
a| 7 6 Yh 
—1 Z 
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Thus the characteristic roots of this ® are } and i Associated with these charac- 


teristic roots are solutions to Eq. (C.11) for vectors v, called the characteristic 
or eigenvectors. If we let z — i, then Eq. (C.11) requires 


stile. c 


Adding the matrices, we find that these equations become 


EHIFIEHI cw 


Equations (C.14) are satisfied by any v,, and v,, such that 


Nt Ale 


Vj = 2v, 


from which we conclude that the eigenvector corresponding to z, — 1/2 is given 
by 


a 
EL | 2a Jt (C.15) 
We can arbitrarily select the scale factor a in Eq. (C.15). Some prefer to make 


the length? of eigenvectors equal to one. Here we make the largest component of 
v have unit magnitude. Thus the scaled v, is 


l 
“y= | 2 |. (C.16) 


In similar fashion, the eigenvector v, associated with z, = 1/3 can be computed 


to be 
1 
w= (2 | 


Note that even if all elements of ® are real, it is possible for characteristic values 
and characteristic vectors to be complex. 


2 Usually we define the length of a vector as the square root of the sum of squares of its components or, 
if ||v|| is the symbol for length, then [|v]|? = v7 v. If v is complex, as will happen if z; is complex, then 
we must take a conjugate, and we define ||vi|? = (v*)7 v, where v* is the complex conjugate of v. 
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Similarity Transformations 


If we make a change of variables in Eq. (C.7) according to x = Tx, where T is 
an n x n matrix, then we start with the equations 


XQ = Px, 
and, substituting for x, we have 


Tx,,, = PTx,. 


kl 
Then, if we multiply on the left by T~', we get the equation in x, 
x, = T Tx. (TT) 
If we define the new system matrix as W, then the new states satisfy the equations 
Xs Wx,, 
where 
Y =T'OT. (C.18) 
If we now seek the characteristic polynomial of W, we find 
det [zI — W] = det [zI - T! T]. 
Because T^'T = I, we can write this polynomial as 
det[zT ' T — T"'®T], 
and the T^! and T can be factored out on the left and right to give 
det [T~'[zI — ®]T]. 
Now, using property Eq. (C.1) for the determinant, we compute 
det T^! - det [zI — ®] - det T, 
which, by the equation following Eq. (C.4), gives us the final result 
det [zI — W] = det [zI — 45]. (C.19) 


From Eq. (C.19) we see that W and have the same characteristic polyno- 
mials. The matrices are said to be "similar," and the transformation Eq. (C.18) is 
a similarity transformation. 

A case of a similarity transformation of particular interest is one for which 
the resulting matrix W is diagonal. As an attempt to find such a matrix, suppose 
we assume that W is diagonal and write the transformation T in terms of its 
columns, t,. Then Eq. (C.18) can be expressed as 


TW = 9T, 
[t,t,...t,] Y = Olt, t,...t,] 
= [Ðt Ot, ... t ]. (C.20) 
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If we assume that W is diagonal with elements A,,A,,...,4,, then Eq. (C.20) 
can be written as 
A, 0 0 
0 A, 0 
0 
[66 €] =[t, Pt, ... bt, ]. 
i, 
Multiplying the matrices on the left, we find 
[A tA, ---A,t,] = [Pt --- t ]. (G21) 


Because the two sides of Eq. (C.21) are equal, they must match up column by 
column, and we can write the equation for column j as 


At = dt. (C.22) 
15 j 


Comparing Eq. (C.22) with Eq. (C.10), we see that t, is an eigenvector of 
® and A, is an eigenvalue. We conclude that if the transformation T converts 
® into a diagonal matrix W, then the columns of T must be eigenvectors of ® 
and the diagonal elements of W are the eigenvalues of ® [which are also the 
eigenvalues of W, by Eq. (C.19)]. It tums out that if the eigenvalues of ® are 
distinct, then there are exactly n eigenvectors and they are independent; that is, 
we can construct a nonsingular transformation T from the n eigenvectors. 
In the example given above, we would have 


for which 


and the new diagonal system matrix is 


ww ss 
SET 
ali 


as advertised! If the elements of ® are real and an eigenvalue is complex, say 
A, — a + jf, then the conjugate, Aj = a — jf, is also an eigenvalue because 
the characteristic polynomial has real coefficients. In such a case, the respective 
eigenvectors will be conjugate. If v, = r + ji, then v, =v; =r — ji, where r 


p Wie 
LLL 


— ni= AI 
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and i are matrices of real elements representing the real and imaginary parts of the 
eigenvectors. In such cases, it is common practice to use the real matrices r and 
—ias columns of the transformation matrix T rather than go through the complex 
arithmetic required to deal directly with v, and vj. The resulting transformed 
equations are not diagonal, but rather the corresponding variables appear in the 
coupled equations 


n=an—Bv, v=Bn+av. (C.23) 


The Cayley-Hamilton Theorem 


A very useful property of a matrix ® follows from consideration of the inverse 
of zI — ®. As we saw in Eq. (C.5), we can write 


(zI — ®) adj (zI — 4) = I det (zI — 6). (C.24) 
The coefficient of I on the right-hand side of Eq. (C.24) is the characteristic 
polynomial of ®, which we can write as 

a(z) =z" +a +a, +---+4,. 
The adjugate of zI — ®, on the other hand, is a matrix of polynomials in z, found 
from the determinants of the minors of zI — ®. If we collect the constant matrix 
coefficients of the powers of z, itis clear that we can write 

adj (zI — 6) = B,z""'+B,z"?4+---+B,, 
and Eq. (C.24) becomes a polynomial equation with matrix coefficients. Written 
out, it is 
[zI — PJ[B 2"! + Biz"? +--+ B.] -zIc az +- +a I (C25) 


If we now multiply the two matrices on the left and equate coefficients of equal 
powers of z, we find 


B, — I, 
B, = B, +a I= ^al], 
B, = OB, c a,I = $ t a, + a,l, 


B, = 6B, , «a, I= Qa a"? dese al 

0— B, +a l= D" ca," ca"! c--- cal — (C26) 
Equation (C.26) is a statement that the matrix obtained when matrix ® is substi- 
tuted for z in the characteristic polynomial, a (z), is exactly zero! In other words, 
we have the Cayley-Hamilton theorem according to which 


a(®) = 0. (C.27) 


ee eet I 


D 


Summary of Facts from the 
Theory of Probability and 


Stochastic Processes 


D.1 


Random Variables 


We begin with a space of experiments, €), whose outcomes are called w and 
depend on chance. Over the space Q and its subsets Q, we define a probability 
function P,' which assigns a positive number between 0 and 1 to each countable 
combination of subsets in Q to which an outcome (or event") can belong. P has 
the properties that the probability of some outcome is certain, in which case it is 
assigned the value 1 


PQ) = 1, 


and the probability of an outcome that may result from events 0,0, which have 
no common points (the intersection of Q, and Q, is empty) is the sum of the 
probabilities of Q, and Q, 


PIN, UA} = P(Q) + PO). 


In addition to the function P we define a random variable x(w), which 
maps €) into the real line such that to each outcome w in Q we associate a value 
x, and the probability that a chance experiment maps into a value x that is less 
than or equal to the constant a is? 


Pr(x € a) = F (a). (D.1) 


1 See Parzen (1962). 
2 Pr (-) is meant to be read “the probability that {-}.” 
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Figure D.1 
Sketch of the uniform 
density function 


Figure D.2 
Sketch of the normal 
density function 


The function F, (4) is called the distribution function of the random variable. If 
F, is a smooth function? we define its derivative f,(€) as the density function, 
which has the property 


b 
Prasx<b)= | fO. (D2) 


Because the whole space Q maps into the line somewhere, we have 


it f (dt = 1. (D3) 


Two common density functions that we shall have reason to use are the 
uniform density and the normal or Gaussian density. A random variable having 
a uniform density has zero probability of having any value outside a finite range 
between lower limit £ and upper limit u, £ < x < u, and f, is constant inside this 
range. Because of Eq. (D.3), the constant is 1/(u — 2). A sketch of the uniform 
density is given in Fig. D.1. 


-4 -3 -2 -l 0 1 2 3 4 -um 


3 Using the impulse, we can include simple discontinuities, as must be done if a specific value of x has 
nonzero probability. 
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The normal density function is given by the equation 


l 1 -uy 
maa DA 


and shown in the sketch in Fig. D.2. 
The importance of the normal density derives mainly from the following 
facts: 


S) = 


1. The distribution of a random variable based on events which themselves con- 
sist of a sum of a large number of independent* random events is accurately 
approximated by the normal law. Such a distribution describes electrical noise 
caused by thermal motions of a large number of particles as in a resistor, for 
example. 

2. If two random variables have (jointly) normal distributions, then their sum 
also has a normal distribution. (As an extension of this second point, if the 
input to a linear system is normal, then the distribution of its output is also 
normal.) 


D.2 Expectation 


By the very nature of variables whose values are dependent on chance, we cannot 
discuss a formula for the calculation of values of x. To describe arandom variable, 
we instead discuss average values such as the arithmetic mean or the average 
power. Such concepts are contained in the idea of expectation. The expected 
value of a function g of a random variable whose density is f, is defined as 


£26) = / 20 f. dE 
= g(x). (D.5) 


Important special cases are the mean, variance, and mean square. If g(x) = x, 
then we have the mean, namely 


ae j &f (dt =F =n, (D.6) 


For the uniform distribution, the mean is given by 


r= [eren 


K u l z 1 E? 
=| | |= u—t B 


4 The technical definition of "independent" will be given shortly. 


l 
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1 u^ od u+é 
= cea SN : D.7 
ZIP ;| 2 Dep 


Because the probability density function has the intuitive properties of ahistogram 
of relative frequency of occurrence of a particular x, we see that the mean, like 
an arithmetic mean, is a weighted average of the random variable values. 

If g(x) in Eq. (D.5) is g = (x — x)’, then the expected value is the average 
of the square of the variation of the variable from the mean; this number is called 
the variance of x, written var x. For the uniform density we compute 


E(x —xy-varx 


= i (E — xy f, dt 
]€-9 € 
t u-t 
l (€ — x) |" 
“tl 3 ii 
l (u—x)  (£—xy 
earl Se i 


Substituting Eq. (D.7) for x, we find 


dias zo u—t\* (555) 
ae ssl 2 )- 2 : 


and, simplifying, we get 


E 
var x = C = . (D.8) 


The square root of the variance is called the standard deviation and given the 
symbol c. Thus, for the uniform density 
2 (0-0? 
= " D.9 
n 12 Pee) 
If we let g(x) be simply x^, then we compute by Eq. (D.5) the mean-square 
value x?. However, consider an expanded expression for the variance, and let the 
mean value of x be u 


deis Í (E — u^ f. (48 


z / E? — £u + p f, (dE 


=x*— 2p? +p? 
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Thus we find 
v= var x + B 
For the uniform density, the mean-square value is, therefore 
x s (u — £y n (y 
12 2 ` 


The integration is more complicated in the case of the normal density, but the 
definitions are the same; namely, if x has the normal density given by Eq. (D.4), 
then 


z= f LOdE =u, (D.10a) 
and 


£e -uy- [ E- ud 
var x = o2. (D.10b) 


Note that the mean and standard deviation already appear as parameters of the 
density given by Eq. (D.4). Because these two parameters completely describe 
the normal density, it is standard to say that a random variable with normal density 
having mean 4 and standard deviation ø is distributed according to the N(u, o) 
law. 


More Than One Random Variable 


Frequently the random experiment has an outcome that is mapped into several 
random variables, say x,, x,, ..., x, or, organized as a column matrix 


x = (xx, .. UE 
For this collection we define the probability that 
X. Sg Sasa k d 
as 
Pr{x < a} 


and let this be the distribution F, (£) with vector argument. The corresponding 
density f (Ẹ) is a function such that the probability that x is in a box with sides 
«, and b, is given by 


b, b, 
Pra <x <b) = f «f fes s EE dE, (D.11) 
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The mean of the vector is a vector of the means 
EX} = By = (A, b, He) (D.12) 


and likewise the deviation of the random variables from the mean-value vector is 
measured by a matrix of terms called the covariance matrix, cov x, defined as 


cov x = E{(x—p,)(x— p,)7} = Ry. (D.13) 
Thus cov x is a matrix with the element in row i and column j given by 
ELX; — n); — n). 


Often the symbol R, will be used for the covariance matrix of a random vector 
x. In an obvious way we can extend Eq. (D.13) to include the case of different 
random vectors x and y with mean vectors p, and p... We define the covariance 
of x and y as : 


cov(x, y) = £((x — (y — p,)"} = Ry- (D.14) 


Consider now the special case of two random variables x and y having distribution 
F, (6, n) and (joint) density f, ,(5, n). Because the probability that y has any 
value is the integral from —oo to +00 over 7, the density of x is given by 


LO= | F€ ma. (D.15) 
If f, , is such that 
f.y€ = f, OFM, (D.16) 


we say that x and y are independent. 

It often happens that we know a particular value of one random variable, 
such as the output y of a dynamic system, and we wish to estimate the value of 
a related variable, such as the state x. A useful function for this situation is the 
conditional density, defined as the density of x given that the value of y is n and 
is defined as 


f. m 
= ——-. Di 
fa Elmo eS (D.17) 
Note that, from Eqs. (D.15) and (D.16), if x and y are independent, then 
f,,,61m = f,G). (D.18) 


The most important multivariable probability density is the normal or Gaussian 
law given by 


Ji 
FE) = [(27)" |R, |"? exp [-56- BR; 6-5). (D.19) 


where | R, | is the determinant of the matrix R,. 
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For the multivariable normal law, we can compute the mean 
E{x} = po 
and the covariance matrix 
E{(x — p(x — w,)"} =R,. 


Like the scalar normal density, the multivariable law is described entirely by 
the two parameters p and R, the difference being that the multivariable case is 
described by matrix parameters rather than scalar parameters. In Eq. (D.19) we 
require the inverse of R, and have thus implicitly assumed that this covariance 
matrix is nonsingular. [See Parzen (1962) for a discussion of the case when R, is 
singular. ] 


Stochastic Processes 


In a study of dynamic systems, it is natural to have random variables that evolve 
in time much as the states and control inputs evolve. However, with random 
time variables it is not possible to compute z-transforms in the usual way; and 
furthermore, because specific values of the variables have little value, we need 
formulas to describe how the means and covariances evolve in time. A random 
variable that evolves in time is called a stochastic process, and here we consider 
only discrete time. 

Suppose we deal first with a stochastic process w(k), where w is a scalar 
distributed according to the density f (N). Note that the density function depends 
on the time of occurrence of the random variable. If a variable has statistical 
properties (such as f,) that are independent of the origin of time, then we say 
the process is stationary. Considering values of the process at distinct times, 
we have separate random variables, and we define the covariance of the process 
w as 


R G, k) = E(w) — WJ) (wk) — Wk). (D.20) 


If the process is stationary, then the covariance in Eq. (D.20) depends only on 
the magnitude of the difference in observation times, k — j, and we often will 
write R, ( j,k) = R,(k — j) and drop the second argument. Because a stochastic 
process is both random and time dependent, we can imagine averages that are 
computed over the time variable as well as the averages defined in Section D.2. 
For example, for a stationary process w(n) we can define the mean as 


N 


Y we +h), (D.21) 


nz-N 


= li 
nas ON E 
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and the second-order mean or autocorrelation 


E{(w(j) — &)(w(k) — w(k))} = 


1 N 


lim 
Noo 2N +|] 


n=- 


(Qo(n + j) — W(j))Qw(n +k) — 8()) (D.22) 
N 


For a stationary process, the time average in Eq. (D.21) is usually equal to the 
distribution average, and likewise the second-order average in Eq. (D.22) is the 
same as the covariance in Eq. (D.20). Processes for which time averages give the 
same limits as distribution averages are called ergodic. 

A very useful aid to understanding the properties of stationary stochastic 
processes is found by considering the response of a linear stationary system to 
a stationary input process. Suppose we let the input be w, a stationary scalar 
process with zero mean and covariance R„(j), and suppose we take the output to 
be y(k). We let the unit-pulse response from w to y be A(j). Thus from standard 
analysis (see Chapter 4), we have 


yi) = Y nd)wG - 0, (D.23) 


kz-—oo 


and the covariance of y(j) with y(j + £) is 
RO = Ey + Oy) 


=E} hkwj+e zi | Y hewG -m . (D24) 


=-00 n--—oc 


Because the system unit-pulse response, A (k), is not random, both A(k) and h(n) 
can be removed from the integral implied by the € operation, with the result 


RO = X hk) Y h Elw + £— bw — n). (D.25) 


kz-oo n--oo 


The expectation in Eq. (D.25) is now recognized as R,(£ — k + n), and substi- 
tuting this expression in Eq. (D.25), we find 


R(= D> Atk) Y ho) RG — kan). (D.26) 


k=-00 n--oo 


Equation (D.26) is not especially enlightening, but the z-transform of it is. We 
proceed with several simple steps as follows 


2{R,(2)} = 3 RO 


oo eo oo 


= 2 Yo Atk) SO AMR, (EC —k nz. 


£——00 k=—00 n--oo 
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Exchanging the order, because 4 (k) and h(n) do not depend on £, we have 


2{R,(2)} = x h(k) y h(n) Y R, (E — k - n)z *. 


=-00 n=-00 t=- 


Now we let m = £ — k + n in the last sum, leading to 


z{R,(2)} = Y h(k) 5s h(n) y Ri (nj m 


kz:—oo n--oo m=-00 


-—(m+k—n) -m „=k n 


Finally we use the fact that z = z ™z~*z" and distribute these terms to the 


corresponding sums with the result 


ARO = Yo he D> hz" 0. RO” 


kz-oo n=—00 m=-—00 


=e). (D.27) 


For reasons soon to be clear, we call the z-transform of R, the spectrum of y 
and use the symbol S,(z), and similarly for w and S,,. With these symbols and 
recognition that the z-transform of the unit-pulse response is the system-transfer 
function, H (z), Eq. (D.27) becomes 


S,(z) = H(z) H(z )S, (2). (D.28) 


To give an interpretation of Eq. (D.28) we make two observations. First note 
that R, (0) =E (y?) is the mean-square value or power in the y-process. By the 
inverse transform integral, we have 


y! = R,(0) 
= fs iet 
COAT PST 
= - frons, o. (D.29) 


T nj 


Now, as a second step, we suppose that H(z) is the transfer function of a very 
narrow bandpass filter centered at w, so that H(z)H(z7') is | H(e/^v") |? and is 
nearly zero except at w. Then the integral in Eq. (D.29) can be approximated 
by assuming that 5, (z) is nearly constant at the value S (eE), where | H |? is 
nonzero and can thus be removed from the integral. The result is 


= . l d 
y = S, (e!) — jenen 
2zj z 
= S, (e/^»^) K. (D.30) 


In Eq. (D.30) we have defined the integral as a constant dependent on the exact 
area of the narrow-band characteristic of H (z). But now wecan give good intuitive 
meaning to Eq. (D.30). The mean square of the output of a very-narrow-band filter 
is proportional to the S, . If S,, is constant for all z, we say the process is white 
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(after the spectrum of white light, which has equal intensity at all frequencies). 
Hence we call S, the power spectral density of the w-process. Equation (D.28) 
is the fundamental formula for transform analysis of linear constant systems with 
stochastic inputs. 

An alternative to transform analysis and Eq. (D.28) is transient analysis via 
the state-variable formulation, described in Section 4.3.3. In this case, we take 
the system equations to be 


x(k + 1) = x(k) + Tw), 


y(k) = Hx(k). (D.31) 
We assume that the system starts at k = 0 with the initial value 
E{x(0)} = 0, 
E{x(0)x7(0)} = R,, (0; 0) 
= P,(0), 


and that w(k) is a stationary process with covariance 
E{w(k)w’ (k + j)} 20 (G #0) 
zR. (p. (D.32) 
Note that by Eq. (D.27), S, (z) = R, a constant, and hence w is a white process. 


With these conditions, we can compute the evolution of E. the autocovariance 
of the state at time k. Thus 


E{x(k)x7 (k)} = R (K; k) 
= P,(k), 
and 
£(x(k + Dx! (k + 1)} =P (k + 1) 

= E{(Ox(k) +T w(k))(®x(k) + Py w(k))7} 

= DE (x(k)x" (k)) b" + £(x(k)w"(k)TT 

+ T,£(w(ox! (0) + T,£(iw)w'(X)T. (D.33) 
Because the center two terms in Eq. (D.33) are zero by Eq. (D.32)? we reduce 
Eq. (D.33) to 

P (k + 1) = 6P,(009 +T, R Ti, 
P (0) = a given matrix initial condition. (D.34) 

Equation (D.34) is the fundamental equation for the time-domain analysis of 


discrete systems with stochastic inputs. Note that Eq. (D.34) represents a non- 
stationary situation because the covariance of x(k) depends on the time k of 


5 x(k) is a combination of w(0), w(1)..... w(k — 1), all of which are uncorrelated with w(k). 
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the occurrence of x. However, if all the characteristic roots of ® are inside the 
unit circle, then the effects of the initial condition, P,(0), gradually diminish, 
and P, approaches a stationary value. This value is given by the solution to the 
(Lyapunov) equation 


P, =P +r RC- (D.35) 
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MATLAB Functions 


Table E.1 Mataas functions and pages where used 
Contro! Digital 
Student System Control 
Function Edition MarLAB Toolbox Toolbox 
(. m file) v5 v5.0 v4.0 3e Page(s) 
acker x x 43,46, 286,295,297 
bode x x 33,38,134 
c2d x x 99,107,116,281,288 
connect x 219 
conv x x 28,63 
d2c x x 470 
damp x x 45,64 
disrw x 397,455 
dlqr x x 377,409 
dlyap x x 435,455 
eig x x 45,109 
feedback x x 29,62,288 
impulse x x 18,131 
initial x 288 
dequiv x 380 
liman x x 47,395,415 
loglog x x 134 
lar x x 44 
Iqrd x x 380 
Isim x x 15,49,326 
lyap x x 455 
margin x x 64 
nyquist x x 242 
place x x 43,46,286,295 
plot x x 18 
qss x 431 
qwc x 429 
refi x 313 
ripple x 182,231 
rlocfind x x 29 
rlocus x x 29,216 
roots x x 45 
semilogx x x 134 
set x x 181 
ss x x 13,14,280 
ssdata x x 108,288 
step x x 15, 18,29,62, 131 
tf x x 13,14,81 
tfdata x x 99 
tzero x x 110 
zgrid x 128 
zpk x x 13,14,82 
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Differences Between MATLAB v5 


and v4 


F.1 


All the MATLAB scripts in the text are based on MATLAB v5 and the Control System 
Toolbox v4, which we will refer to as “v5.” Some users of this text may be using 
MATLAB v4 with the Control System Toolbox v3 or earlier versions, which we 
will refer to as “v4.” This Appendix describes the differences between these two 
versions of MATLAB and is intended to guide the reader with v4 how to interpret 
the scripts in the text. As discussed in the Preface, the MATLAB descriptions in 
the text are not meant to be a replacement to the MATLAB manuals; rather, their 
intent is to lead the reader to the routine in MATLAB that can be used to carry out 
a specific calculation in the text. In some cases it may eliminate the need to look 
at the manual or help file, but in most cases it will primarily point the reader to 
the right routine. 

The most significant difference is the way that systems are described; there- 
fore, that topic will be discussed first in Section F.1 along with a table of some of 
the associated name changes of the routines. Another difference that especially 
affects users of this text is the way that systems are converted from continuous 
to discrete models, a topic covered in Section F.2. The changes in the calculation 
of optimal estimation parameters are discussed in Section F.3. 


System Specification 


In either version, a linear system is specified in one of three forms; transfer 
function, tf, zero-pole-gain, zpk, or state-space, ss. In v4, the tf description was 
specified by numerator and denominator polynomials, often referred to as num 
and den while in v5 both these quantities are appended in one system description 
variable, often called sys. As a result, the routines that require a transfer function 
system specification in v4 require num and den variables in the call sequence 
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rather than the one variable, such as sys. For example, in v4 the impulse and step 
functions described in Section 2.1.6 would be 


impulse(num, den) or 
step(num, den). 


Likewise, if a system is described by the zero-pole-gain form, three variables are 
required and the statements above would be 


impulse(z, p, k) or 
step(z, p, k), 


or, if in the state-space form, four matrices are required and the statements would 
be 


impulse(A, B, C, D) or 
step(A, B, C, D). 
In v5, the statements on page 18 are the same for any model form, that is 
impulse(sys) or 
step(sys). 


To convert from one form to another in v4, conversion routines are used. For 
example, to convert from ss to tf, one writes 


[num,den] = ss2tf(A,B,C,D). 


For other conversions, v4 has ss2zp, tf2ss, tf2zp, zp2tf, and zp2ss for the 
specific conversion required. None of these exist in v5; rather, all conversions are 
accomplished using ss, tf, and zok as shown on page 14. 

The v5 system description includes several properties in addition to the 
differential or difference equations parameters discussed above. The sample time, 
Ts, and the time delay, Td, are two of the important new properties included within 
the system description. If Ts =0, it means the system description is for a continuous 
set of differential equations. If Ts = 0, it means the system description is for a 
discrete set of difference equations. A nonzero value of Td means that the system 
has a pure time delay. Because the system description includes information on 
whether the system is continuous or discrete, separate routines are usually not 
required for the two cases as they were for v4. For example, in v4 there is 
dbode for the discrete case and bode for the continuous case whereas in v5 
there is only one bode for both cases. The routine itself determines from the 
system parameters whether it is continuous or discrete and then carries out the 
appropriate calculations. For example, on page 134 the text states that a bode plot 
can be obtained for a discrete system by using bode. In v4, it would require the 
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use of dbode. Table F.1 lists all of the routines mentioned in this text which have 
been combined into one version for v5. 
A summary of how to adjust for these new features is: 


e Anywhere in the text where there is a routine with an argument consisting of 
a system description (usually some variant of sys), use of that routine in v4 
would require replacing that system description with the specific variables 
for the form of the system being used. For the tf form it could be num, den, 
for the zpk form it could be z,p,k, and for the ss form, it could be A,B,C,D. 


e Separate versions of many routines that existed in v4 for the continuous 
and discrete cases have been eliminated in v5. Therefore, when performing 
calculations with discrete systems, it may be necessary to adjust to the discrete 
version of the routine from that stated in the text. The routines affected are 


listed in Table F.1. 

Table F.1 Selected discrete MATLAB routines reduced to one version for v5 
MATLAB v4 MATLAB v5 
bode, dbode bode 
damp, ddamp damp 
impulse, dimpulse impulse 
initial, dinitial initial 
Iqe, diqe kalman 
Isim, dlsim Isim 
nichols, dnichols nichols 
nyquist, dnyquist nyquist 
reg, dreg reg 
step, dstep step 


F.2 Continuous to Discrete Conversion 


Conversion of a continuous system to a discrete equivalent is first discussed in 
Section 4.3.1 on page 99 and comes up repeatedly throughout the text. This 
conversion was carried out using the C2d routine, which accomplishes the con- 
versions for all cases in v5. To make matters somewhat confusing, there is also a 
c2d routine in v4, but with significantly reduced capabilities in that it would only 
accept systems in the state-space form and would only find the ZOH equivalent. 
Routines with different names exist in v4 to handle other forms, other equivalents, 
and time delay. All these capabilities were rolled into c2d for v5. The situation 
is summarized in Table F.2. 
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Table F.2 


F.3 


MATLAB discrete equivalent routines 


Calculation MATLAB v4 MATLAB v5 


convert ss form using c2d c2d 
'zoh' equivalent method 


convert any form c2dm c2d 
using any method 


convert ss form c2dt c2d 
using 'zoh' method with Td ¥ 0 
with input delay 


Optimal Estimation 


The calculation of the optimal steady state gain for an estimator in Eq. (9.97) 
is performed by MATLAB's kalman routine in v5. This routine also computes 
the complete optimal estimation system equations as part of its duties. In v4, 
the optimal gain is computed by dlqe and it is left for the user to construct 
the estimator equations. There are several versions of LQE calculations in v4 
which have slightly different capabilities (Ige,dlge,Igew,dlgew,lge2,lqed). All 
of these have been replaced in v5 by kalman with the exception of lged which 
was replaced by kalmd. 
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